Introduction to Topology 

Pure and Applied 


Colin Adams 

Williams College 


Robert Franzosa 

University of Maine 





The author and publisher of this book have used their best efforts in preparing this book. These efforts 
include the development, research, and testing of the theories and programs to determine their 
effectiveness. The author and publisher make no warranty of any kind, expressed or implied, with regard 
these programs or the documentation contained in this book. The author and publisher shall not be liable' 
any event for incidental or consequential damages in connection with, or arising out of, the furnishing, 
performance, or use of these programs. 

Pearson Prentice Hall™ is a trademark of Pearson Education, Inc. 

Original Edition entitled Introduction to Topology: Pure and Applied, First Edition, by Adams, Colin; 
Franzosa, Robert, Paul, published by Pearson Education, Inc, publishing as Prentice Hall, Copyright 
© 2008 

Indian edition published by Dorling Kindersley India Pvt. Ltd. Copyright © 2009 

All rights reserved. This book is sold subject to the condition that it shall not, by way of trade or otherwis 
be lent, resold, hired out, or otherwise circulated without the publisher’s prior written consent in any forn 
of binding or cover other than that in which it is published and without a similar condition including this 
condition being imposed on the subsequent purchaser and without limiting the rights under copyright 
reserved above, no part of this publication may be reproduced, stored in or introduced into a retrieval 
system, or transmitted in any form or by any means (electronic, mechanical, photocopying, recording or 
otherwise), without the prior written permission of both the copyright owner and the above-mentioned 
publisher of this book. 

ISBN 978-81-317-2692-1 

First Impression, 2009 

This edition is manufactured in India and is authorized for sale only in India, Bangladesh, Bhutan, 
Pakistan, Nepal, Sri Lanka and the Maldives. Circulation of this edition outside of these territories is 
UNAUTHORIZED. 

Published by Dorling Kindersley (India) Pvt. Ltd., licensees of Pearson Education in South Asia. 

Head Office:7* Floor, knowledge Boulevard, A-8(A) Sector-62, Noida (U.P) 201301, India. 

Registered Office: 14 Local Shopping Centre, Panchsheel Park, New Delhi 110 017, India. 


Printed in India by Pushp Print Services. 



Colin’s Dedication: 
To Alexa and Colton 


Bob’s Dedication: 
Alla Famiglia 




Contents 


Preface 11 

0 Introduction 19 

0.1 What is Topology and How is it Applied?.19 

0.2 A Glimpse at the History.25 

0.3 Sets and Operations on Them.27 

0.4 Euclidean Space. 32 

0.5 Relations .35 

0.6 Functions. 37 

1 Topological Spaces 42 

1.1 Open Sets and the Definition of a Topology. 42 

1.2 Basis for a Topology. 46 

1.3 Closed Sets. 56 

1.4 Examples of Topologies in Applications. 61 

2 Interior, Closure, and Boundary 73 

2.1 Interior and Closure of Sets. 73 

2.2 Limit Points. 78 

2.3 The Boundary of a Set. 83 

2.4 An Application to Geographic Information Systems. 86 

3 Creating New Topological Spaces 94 

3.1 The Subspace Topology. 94 

3.2 The Product Topology .100 

3.3 The Quotient Topology.106 

3.4 More Examples of Quotient Spaces .115 

3.5 Configuration Spaces and Phase Spaces.123 

4 Continuous Functions and Homeomorphisms 130 

4.1 Continuity.130 

4.2 Homeomorphisms.140 

4.3 The Forward Kinematics Map in Robotics.153 

5 Metric Spaces 161 

5.1 Metrics.161 

5.2 Metrics and Information .168 

5.3 Properties of Metric Spaces.173 

5.4 Metrizability .180 


7 





























8 Contents 


6 Connectedness 186 

6.1 A First Approach to Connectedness .186 

6.2 Distinguishing Topological Spaces via Connectedness.196 

6.3 The Intermediate Value Theorem.203 

6.4 Path Connectedness.209 

6.5 Automated Guided Vehicles.215 

7 Compactness 223 

7.1 Open Coverings and Compact Spaces.223 

7.2 Compactness in Metric Spaces.231 

7.3 The Extreme Value Theorem.238 

7.4 Limit Point Compactness.245 

7.5 One-Point Compactifications.250 

8 Dynamical Systems and Chaos 256 

8.1 Iterating Functions.256 

8.2 Stability.264 

8.3 Chaos.272 

8.4 A Simple Population Model with Complicated Dynamics . . .281 

8.5 Chaos Implies Sensitive Dependence on Initial Conditions . . 287 

9 Homotopy and Degree Theory 291 

9.1 Homotopy.291 

9.2 Circle Functions, Degree, and Retractions.295 

9.3 An Application to a Heartbeat Model.302 

9.4 The Fundamental Theorem of Algebra.306 

9.5 More on Distinguishing Topological Spaces.308 

9.6 More on Degree.313 

10 Fixed Point Theorems and Applications 323 

10.1 The Brouwer Fixed Point Theorem.323 

10.2 An Application to Economics.328 

10.3 Kakutani’s Fixed Point Theorem.336 

10.4 Game Theory and the Nash Equilibrium.343 

11 Embeddings 351 

11.1 Some Embedding Results.351 

11.2 The Jordan Curve Theorem.359 

11.3 Digital Topology and Digital Image Processing.366 

12 Knots 376 

12.1 Isotopy and Knots.377 

12.2 Reidemeister Moves and Linking Number.386 

12.3 Polynomials of Knots.393 

12.4 Applications to Biochemistry and Chemistry.399 

































Contents 9 


13 Graphs and Topology 408 

13.1 Graphs.408 

13.2 Chemical Graph Theory .416 

13.3 Graph Embeddings.422 

13.4 Crossing Number and Thickness.431 

14 Manifolds and Cosmology 441 

14.1 Manifolds.442 

14.2 Euler Characteristic and the Classification of Compact Surfaces 454 

14.3 Three-Manifolds .464 

14.4 The Geometry of the Universe.477 

14.5 Determining which Manifold is the Universe.481 

Additional Readings 485 

References 489 

Index 497 












Preface 


Topology is generally considered to be one of the three linchpins of modern 
abstract mathematics (along with analysis and algebra). In the early history 
of topology, results were primarily motivated by investigations of real-world 
problems. Then, after the formal foundation for topology was established 
in the first part of the twentieth century, the emphasis turned to its abstract 
development. However, within the past few decades there has been a significant 
increase in the applications of topology to fields as diverse as economics, 
engineering, chemistry, medicine, and cosmology. 

Our goals in this book are twofold: 

• To introduce topology as an important and fascinating mathematics dis¬ 
cipline; 

• To demonstrate the utility and significance of topological ideas in other 
areas of mathematics, in science, and in engineering. 

A reader of this book will learn the basics of point-set topology and will be 
introduced to follow-up topics such as knots, manifolds, dynamical systems, 
fixed points, and topological graphs. Furthermore, the reader will learn how 
results from topology are used in applications that range from the atomic scale 
in chemistry to the astronomic scale in cosmology. 

Intended Audience 

A minimal background in formal mathematics is required to get started in 
this text. A student who has successfully completed an introductory course 
in abstract mathematics should have sufficient background. In Chapter 0 we 
provide an overview of the background material concerning sets, relations, 
functions, the real line, and Euclidean space needed for the remainder of the 
text. A mathematically mature student who has completed a calculus sequence 
and is capable of independently learning the Chapter 0 overview should be 
sufficiently prepared for this text. We view this text as appropriate for a one- or 
two-semester introduction to topology at the undergraduate level. It can also 
be used for an introductory graduate course. 

How to Use This Text 

The core introduction to point-set topology is found in the following sections 
in the text: 1.1-1.3, 2.1-2.3, 3.1-3.4, 4.1, 4.2, 5.1, 5.3, 6.1-6.4, 7.1-7.3. 

We recommend covering these sections as a core for any course that uses 
this text. Much of the material in the remainder of the text uses many of the 
results from these sections. 

Beyond the core sections, we have attempted to make the remainder of the 
topics in the text as independent of each other as possible. In a table below, we 
discuss and diagram the dependence between these topics. 
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For each of the topology topics beyond the core introduction, and for 
each of the applications, our approach is to provide a short, meaningful, and 
interesting introduction. Our hope is to spark the reader’s interest in the topic 
or application and to plant a seed for further investigation. 

A one-semester introductory topology course could cover the core material 
from Chapters 1-7, along with a few of the additional topics that appear in 
those and the remaining chapters of the text. A subsequent course could take a 
variety of forms. It could proceed through additional topics or applications in 
the text. It could be done in a seminar format where the students investigate the 
additional topics or applications and give presentations about them. It could 
have a focus on the applications of topology and mainly cover that material 
from the text. We also feel that the additional topics and applications lend 
themselves well to independent study by students and would provide a good 
introduction to material to be explored further for an honors or master’s thesis. 

Chapter and Section Interdependency 

There are many pathways that can be followed through this text. Again, we 
recommend that the core introduction in Chapters 1-7 be covered as an intro¬ 
duction to point-set topology. On the next page there is a table of the book’s 
chapters. It should help areader or instructor when picking out topics to cover 
beyond the core introduction and when choosing an appropriate ordering of 
the topics. Here are some important points about the table: 

• The sections corresponding to the core introductory material are circled. 

• We do not address interdependency between Chapters 1-7, only interde¬ 
pendency between the sections within each of these chapters. 

• The core material from Chapters 1-7 is prerequisite for the additional 
topics in Chapters 8-14, but none of the material from the non-core 
sections in Chapter 1-7 is required in Chapters 8-14 (with the exception 
of a few exercises, in which we make reference to the needed material). 
Therefore in the table, we do not address how Chapters 8-14 depend on 
Chapters 1-7. 

• We address how Chapters 8-14 depend on each other, and within each of 
these chapters we also indicate how the sections depend on each other. 


Proofs 

In our development of topology as an abstract mathematics discipline, we 
provide the needed definitions, theorems, and proofs in their proper formality. 

In the core introduction in Chapters 1-7, all of the theorems (with the 
exception of the Urysohn Metrization Theorem) are either proven or have 
proofs that are assigned as exercises. The proofs of these theorems primarily 
follow in a straightforward manner from the definitions and prior theorems. 
In the proofs that we present in these chapters, we have tried to provide good 
examples that readers can use as models when formulating their own proofs 
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• The proof of the Jordan Curve Theorem in Section 11.2 requires 
the No Retraction Theorem and Theorem 9.14 from Section 9.2, 
as well as the Brouwer Fixed Point Theorem from Section 10.1. 
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in assigned exercises. Reading and understanding proofs, and developing and 
writing them, are essential aspects of learning mathematics. 

In Chapters 8-14, some of the theorems are presented without proofs, and 
some others are presented with proofs that exclude technical details. These 
omissions occur because the proofs or the details either require advanced tools 
beyond the scope of the text or are specialized for the setting and would take 
us too far off course to develop properly. We wish to keep the introductions 
to the topics in Chapters 8-14 relatively brief, and we wish to have them flow 
smoothly for the reader. In situations where proofs or details are excluded, we 
usually refer the reader to other sources where further information about the 
topic can be found. 

Topology is a highly visual discipline, and therefore illustrations are often 
beneficial to the understanding of a topic under investigation. We occasionally 
use illustrations to communicate parts of an argument in a proof. This practice 
is common in topology, and there are advantages and disadvantages to doing 
so. \n advantage is that it allows for a good flow of material in a book, a 
research work, or a professional presentation, reducing the details that need to 
be explicitly addressed. A disadvantage, particularly in an introductory text 
such as this, is that it can result in a perception that topology is mainly about 
pictures and is not as rigorous as other fields of mathematics. It is important to 
keep in mind that there is always formal mathematics behind the pictures, and, 
if need be, all of the necessary details can be provided to complete arguments 
that are represented by pictures. 

Illustrations 

As indicated previously, topology is very visual in nature. Therefore, we 
include many illustrations to aid the reader in understanding the material pre¬ 
sented. While the content of the illustrations should be clear from the context, 
we have adopted a few conventions that we use in the illustrations as consis¬ 
tently as possible. We introduce these conventions here, using mathematical 
terms that are defined within the text. 


General Topological Spaces: The topological space is the primary setting for 
the theory presented in the text. A rectangle with a black border and white 
interior usually represents a general topological space. For instance, Figure 1 
depicts a set A in a general topological space X. 



FIGURE 1: A set A in a topological space X. 
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The Plane: The plane is one of the main topological spaces used throughout 
the text. In illustrations, the plane is usually represented by a gray unbordered 
rectangle or parallelogram. For instance, Figure 2 depicts a set A in the plane— 
in one case viewing the plane directly, in the other case viewing the plane at 
an angle. 



FIGURE 2: Two views of a set A in the plane. 


Three-Space: Euclidean 3-space is also one of the main topological spaces 
used in the text. Often, it is left understood that an illustration depicts sets in 
3-space. Occasionally, though, to emphasize that the setting for an illustration 
is 3-space, a set of three-dimensional coordinate axes is included in the illus¬ 
tration. For example, Figure 3(a) depicts a sphere in either no particular setting 
or a setting that is understood from the context, while Figure 3(b) depicts a 
sphere in 3-space. 



Open Sets and Excluded Boundary Parts: Topological spaces are collections 
of open sets. In illustrations, an open set is represented by a set that is bordered 
by a dashed curve. This convention reflects the fact that an open set does not 
include its boundary. Dashed curves are also used to indicate that a portion of 
the boundary of a set is not included in the set. In Figure 4, set A is an open 
set in a topological space X, set B is an open set in the plane, and set C is a 
square-shaped planar set that includes its left and right edges, but not its top 
and bottom ones. 

Dashed curves are also used to indicate hidden or background curves in 
illustrations (as in Figure 3). Usually, these curves are thinner, are lighter gray, 
or have shorter dashes than the dashed curves used to represent open sets or 
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FIGURE 4: Sets A and B are open sets; set C does not include its top and bottom 
edges. 

portions of boundaries that are excluded. Again, it should be clear from the 
context what such curves represent. 

Excluded Points: Occasionally we consider sets that are obtained by removing 
points from other sets. In illustrations, the fact that a point is excluded from a 
set is usually depicted by using a small dashed circle as in Figure 5. 



FIGURE 5: A circle with a point removed, a plane with a point removed, and 3-space 
with a point removed. 
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Chapter 0 

Introduction 


0.1 What is Topology and How is it Applied? 

Topology is one of the most active areas in all of mathematics. Traditionally, 
it is considered one of the three main areas of pure mathematics (together 
with algebra and analysis). Recently, topology has also become an important 
component of applied mathematics, with many mathematicians and scientists 
employing concepts of topology to model and understand real-world structures 
and phenomena. In this book we introduce both aspects of topology: the 
traditional approach as an abstract discipline, and the emerging view as a 
valuable source of tools for applications. In the applied view, we see how 
topology impacts both real-world problems and a variety of results in other 
areas of mathematics. 

Topology grew out of geometry, expanding on some of the ideas and loos¬ 
ening some of the structures appearing therein. The word topology, literally, 
means the study of position or location. Topology is the study of shapes, 
including their properties, deformations applied to them, mappings between 
them, and configurations composed of them. 

Topology is often described as rubber-sheet geometry. In traditional ge¬ 
ometry, objects such as circles, triangles, planes, and polyhedra are considered 
rigid, with well-defined distances between points and well-defined angles be¬ 
tween edges or faces. But in topology, distances and angles are irrelevant. We 
treat objects as if they are made of rubber, capable of being deformed. We allow 
objects to be bent, twisted, stretched, shrunk, or otherwise deformed from one 
to another, but we do not allow the objects to be ripped apart. In Figure 0.1, 
we see four shapes that are very different from a geometric perspective, but are 
considered equivalent in topology. Any one of the four, if made of rubber, can 
be deformed to each of the others. 



FIGURE 0.1: To a topologist these four objects are equivalent. 

In Figure 0.2, we see two objects—the torus and the sphere—that are 
topologically distinct. We cannot deform a sphere into a torus in any permitted 
topological manner, and therefore they are not equivalent from a topological 
perspective. 
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It is often said that a topologist cannot distinguish between a doughnut and 
a coffee cup. The point is that in topology a coffee cup can be deformed into 
the shape of a doughnut. (See Figure 0.3.) These objects are equivalent as far 
as topology is concerned. 



FIGURE 0.3: A coffee cup and doughnut are topologically equivalent. 

You might wonder how this rubber-sheet geometry could be useful in real- 
world applications. The idea is very simple. Often, in a specific situation, 
the properties that are significant are those that are preserved when we treat 
an object as deformable, as opposed to those that are preserved when we treat 
that object as rigid. A topologist cannot distinguish between a doughnut and 
coffee cup, but what a topologist can do—and we will see how important this 
is throughout the text—is identify and use properties that these shapes have in 
common. 

Let us take a quick look at a few topics from topology and the role they 
play in some of the applications that we present in this text. All of the ideas 
that we introduce here are developed more fully in the chapters that follow. 

Topological Spaces and Phenotype Spaces: The objects that we study in 
topology are called topological spaces. These are sets of points on which a 
notion of proximity between points is established by specifying a collection of 
subsets called open sets. The line, the circle, the plane, the sphere, the torus, 
and the Mobius band are all examples of topological spaces. (See Figure 0.4.) 
In Chapters 1 and 3 we will see how these topological spaces and their open sets 
are specifically defined, and we will see how to construct topological spaces 
in a variety of settings and situations. 

The concepts of genotype and phenotype are important in evolutionary 
biology. Every living organism is the physical realization (phenotype) of in¬ 
ternally coded, inheritable information (genotype). Evolutionary change from 
one phenotype to another occurs by changes, called mutations, in their cor¬ 
responding genotypes. We would like to have a notion of how close one 
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FIGURE 0.4: A variety of topological spaces. 


phenotype is to another, reflecting how likely it is that a genotype mutation 
transforms one phenotype to the other. In Section 1.4 we describe how molec¬ 
ular biologists establish a notion of evolutionary proximity by constructing a 
topological space from a set of phenotypes. 


Interior and Boundary, and Region Relationships in a Geographic Infor¬ 
mation System: Given a set in a topological space, two important sets that 
we associate to it are its interior and boundary. For a set A in a space X, we 
intuitively think of the interior of A as the points that are surrounded by nearby 
points in A, and we think of the boundary of A as the points that lie arbitrarily 
close to both A and the set of points that lie outside of A. (See Figure 0.5.) We 
formally define interior and boundary in terms of open sets in Chapter 2. 



FIGURE 0.5: The interior and boundary of the set A. 

A geographic information system (GIS) is a computer system that stores 
and manipulates data that are geographic in nature. A GIS must be able to ad¬ 
dress queries such as, “Does the planned Coltonian landfill overlap the Alexan¬ 
dria conservation zone?” In order to do so, the GIS needs to be able to dis¬ 
tinguish a spectrum of relationships between land regions and needs to have a 
means for determining the particular relationship satisfied by a given pair of re¬ 
gions. In Section 2.4 we present a model for distinguishing such relationships 
based on the interiors and boundaries of sets. For example, if two sets A and 
B intersect only in their boundaries, then we say that A and B “touch” each 
other. In Figure 0.6 we illustrate that relationship and others that we identify 
in the model we present. 
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FIGURE 0.6: Possible relationships between sets A and B. 


Manifolds and Cosmology: An n -manifold is a topological space that locally 
resembles n-dimensional Euclidean space. For example, a 1-manifold locally 
resembles a line, a 2-manifold locally resembles a plane, a 3-manifold locally 
resembles 3-space, and so on. 

A 2-manifold is also called a surface. The sphere and torus are examples 
of surfaces. Each point in a surface lies in an open set that is topologically 
equivalent to an open set in the plane. (See Figure 0.7.) 



FIGURE 0.7: Locally a surface and a plane appear the same. 

Inhabitants of surfaces might believe that they actually live in a plane 
because what they see locally appears the same as if they were on a plane. (This 
notion was exploited in the entertaining nineteenth-century book, Flatland: A 
Romance of Many Dimensions, by Edwin Abbott.) However, if the surface 
inhabitants studied properties of their surface world, they might be able to 
deduce its overall shape and therefore be able to distinguish it from a plane. 

In a similar way, it is natural to feel that the universe in which we live is 
Euclidean 3-space because that is how we perceive it locally. Cosmologists try 
to determine the exact shape of the universe by studying properties that reveal 
aspects of its overall structure. In Chapter 14 we examine 3-manifolds, and 
we discuss approaches cosmologists are taking to determine which 3-manifold 
models the shape of our universe. 

Embeddings, Knots, and DNA: An embedding is a function that takes one 
topological space and places a copy of it within another topological space. In 
Figure 0.8 we illustrate the results of two embeddings of a circle in 3-space. 
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FIGURE 0.8: Embeddings of the circle in 3-space. 

These embeddings appear distinctly different. On the left, the embedded 
circle is knotted, while on the right it is not. The study of embeddings of the 
circle in 3-space is called knot theory. Determining the different types of knots 
and how they are related to each other is an important aspect of this area of 
topology. 

In knot theory, one of the fundamental operations that we perform on 
knots is a crossing change. As in Figure 0.9, we switch the crossing so that the 
understrand becomes the overstrand and vice versa. It is natural to ask what 
effect this operation has on a knot. 



FIGURE 0.9: The crossing change operation. 

We see this same operation occurring at the molecular level within the 
nucleus of the cell. Deoxyribonucleic acid (DNA) is a long, thin molecule 
made up of millions of atoms. This long strand is stuffed inside the nucleus 
of the cell, a packing comparable to placing 200 kilometers of tangled fishing 
line inside a basketball. 

In order for essential life processes to work, the biological machinery of 
the cell must be free to access and manipulate the DNA molecule. The cell’s 
ability to efficiently untangle DNA is therefore crucial to its survival. Inside the 
nucleus of the cell there are molecules, called enzymes, that act as biological 
tools. Some of these enzymes make crossing changes in the DNA, allowing it 
to untangle. An enzyme cuts the DNA strand at a place where it crosses over 
itself and then reattaches it so that the crossing is of the opposite type. Recently, 
new chemotherapy agents have been introduced that inhibit the enzymes from 
acting, thereby preventing cancerous DNA from re-creating itself. We discuss 
knot theory and this application in Chapter 12. 

Fixed Points and Economics: A function that maps a topological space to 
itself has a fixed point if there is a particular point in the space that is mapped 
to itself by the function. For example, point p in Figure 0.10 is a fixed point 
of the function / illustrated there. 

Fixed-point theory is an important area of topology; it involves the inves¬ 
tigation of questions such as “Which functions mapping a topological space to 
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FIGURE 0.10: The function / has a fixed point at p. 

itself have a fixed point?” and “Which topological spaces are such that every 
continuous function mapping the space to itself has a fixed point?” The most 
well-known result in this area is the Brouwer Fixed Point Theorem. It asserts 
that every continuous function on a closed n-dimensional ball has a fixed point. 
For instance, it says that in dimension one every continuous function from a 
closed interval to itself has a fixed point, and, in dimension two, every continu¬ 
ous function from a disk to itself has a fixed point. (See Figure 0.11.) We prove 
these one- and two-dimensional results in Chapters 6 and 10, respectively. 


/ 



FIGURE 0.11: Continuous functions mapping the interval to the interval or the disk 
to the disk must have a fixed point. 

In a model economic system there are consumers and goods and a set of 
variables associated with them, including the supply of the goods, the price of 
the goods, and the demand for the goods. In general, the values of the variables 
change in the near future, depending on their values at the present time. For 
example, a current short supply of applied topology texts might result in a 
higher price for them in the near future. Economists are interested in knowing 
if an economic system can be in equilibrium, a state where the consumers are 
appropriately satisfied and there are an unchanging supply, price, and demand 
associated with each of the goods. In Section 10.2 we apply the Brouwer Fixed 
Point Theorem in such an economic model and demonstrate that equilibrium 
states are possible., 


The conclusion of the Brouwer Fixed Point Theorem is an existence result. 
It asserts the existence of a fixed point, but it does not indicate where it is 
located. This is the nature of many of the results encountered in topology and 
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employed in applications. It is a demonstration of the qualitative nature of the 
conclusions derived using topological tools. 

Since quantities such as distance and angle measure are irrelevant in topol¬ 
ogy, we do not expect to extract quantitative results in applications of topology. 
In general, topology aids in the qualitative analysis of a system. It helps identify 
possibilities or impossibilities that arise as a result of underlying structures. 

There are three important components to an application of topology: 

(i) Identification of the pertinent topological structures and relation¬ 
ships. 

(ii) Proper definition of the topological concepts and rigorous verifica¬ 
tion of the topological results. 

(iii) Application of the topological concepts and results to draw con¬ 
clusions. 

As a result, throughout the book we will have a threefold view of topology: 
a broad view, a theoretical view, and an applied view. 

In the broad view, we intuitively regard topology as a general study of 
shapes and their properties. These are the topological aspects of applied prob¬ 
lems that we identify and examine as we study real-world systems and draw 
conclusions about them. 

The theoretical view of topology is the classic view of topology as an 
abstract mathematics discipline. We build this theory from its foundation so 
that we can properly define the concepts and structures that we investigate in 
the broad view and so that we have an appropriate theoretical basis to support 
the tools and relationships that we employ in the applied view. 

Finally, in the applied view we study applications of topology involving 
the relationships and results that we develop. The examples discussed previ¬ 
ously provide an initial look at the types of real-world applications we will 
consider. Furthermore, we examine how topological concepts help to define 
and reveal important structures and relationships in other areas of mathematics. 
For example, we identify topological properties underlying the Fundamental 
Theorem of Calculus in Chapter 7, we use topological concepts to define chaos 
in dynamical systems in Chapter 8, and we use topological results to prove the 
Fundamental Theorem of Algebra in Chapter 9. 

0.2 A Glimpse at the History 

Topology is popularly considered to have begun with Leonhard Euler’s (1707- 
1783) solution to the famous Konigsberg bridges problem. Let us take a quick 
look at this problem. 

In the eighteenth century, the river Pregel flowed through the city of 
Konigsberg, in Prussia (now Kaliningrad, in Russia), dividing it into four sep¬ 
arate regions. There were seven bridges that crossed the river and connected 
the regions as illustrated in Figure 0.12. 

A favored pastime was to take a bridge-crossing stroll through Konigsberg. 
People asked, “Can you take a stroll through the city, crossing each bridge 
exactly once?” Curiously, no one was able to find a way to do so. 
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FIGURE 0.12: The Konigsberg bridges. (Adapted from an illustration in [New].) 

The problem caught the attention of Leonhard Euler, who realized that it 
involved a new mathematical approach, called “the geometry of position.” He 
wrote: 

Recently, there was announced a problem which while it certainly 
seemed to belong to geometry, was nevertheless so designed that 
it did not call for the determination of magnitude, nor could it be 
solved by quantitative calculation; consequently I did not hesitate 
to assign it to the geometry of position, especially since the solution 
required only the consideration of position, calculation being of no 
use. [New] 

Euler analyzed the problem and proved that it was impossible to accom¬ 
plish the desired bridge-crossing walk, given the configuration of regions and 
bridges. We examine this problem further in Section 13.1. 

Following the initial work of Euler, a number of prominent mathematicians 
made valuable contributions to the geometry of position over the next century 
and a half. These included Carl Friedrich Gauss (1777-1855), August Ferdi¬ 
nand Mobius (1790-1868), Johann Listing (1808-1882), Bernhard Riemann 
(1826-1866), Felix Klein (1849-1925), and Henri Poincare (1854-1912). 

The term “topology” first appeared in the title of Listing’s paper, “Vorstu- 
dien zur Topologie” in 1847, but the term was not commonly used and the disci¬ 
pline was not formally defined until several decades later. “Analysis situs” was 
the expression primarily used for this area of geometry, and in the introduction 
to his 1895 paper “Analysis situs,” Poincare wrote of the geometry-of-position 
philosophy: 

The proportions of the figures might be grossly altered, but their 
elements must not be interchanged and must conserve their relative 
situation. In other terms, one does not worry about quantitative 
properties, but one must respect the qualitative properties, that is 
to say precisely those which are the concerns of Analysis Situs. 

[Sar] 
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Many nineteenth-century efforts in the geometry of position were moti¬ 
vated by applied problems, including James Clerk Maxwell’s (1831-1879) and 
Peter Guthrie Tait’s (1831-1901) work on knots (arising from investigations in 
chemistry), Gustav Kirchoff’s (1824-1887) study of electrical networks, and 
Poincare’s analysis of celestial mechanics. 

In the late nineteenth- and early twentieth-century there were numerous 
contributions to the growing discipline that would soon become the field of 
topology. L. E. J. Brouwer (1881-1966), Georg Cantor (1845-1918), Mau¬ 
rice Frechet (1878-1973), Felix Hausdorff (1868-1962), Poincare, Frigyes 
Riesz (1880-1956), and Hermann Weyl (1885-1955) were some of the math¬ 
ematicians involved. Hausdorff’s 1914 book Grundziige der Mengenlehre 
(Fundamentals of Set Theory) introduced an axiomatic foundation for topo¬ 
logical spaces and thereby initiated the general study of topology as an abstract 
mathematics discipline. 

Throughout most of the twentieth century, topology developed primarily 
as a branch of abstract mathematics. However, in recent years there has been 
dramatic growth in the applications of topology. We explore topology from 
both of these perspectives in the chapters that follow. 

In the remaining sections of this chapter, we review some basic topics in¬ 
volving sets, the real number system, Euclidean space, relations, and functions. 
These topics play an important role throughout the text. More detailed devel¬ 
opments of this material can be found in introductory abstract mathematics 
texts such as [Bio] and [Hum]. 

0.3 Sets and Operations on Them 

In this section we consider sets, relationships between them, and operations on 
them. 

Throughout the text we employ standard set-builder notation for defining 
sets. We use the term elements to refer to members of sets, and we write 
x € A to mean x is an element of set A. Most of the sets that we encounter in 
topology are considered to be sets of points, so in those circumstances we use 
elements and points interchangeably. The set containing no elements is called 
the empty set; we denote it by 0. 

We denote the real line or the set of real numbers by R. Important subsets 
of M that we employ are the positive real numbers R+, the integers Z, the 
positive integers Z+, and the rational numbers Q. 

We assume all of the basic algebraic and order properties of the real number 
system. 

One such property states that between every pair of real numbers there is an 
irrational number and a rational number. This property is useful in exploring 
the topology of the line. It follows that arbitrarily close to every real number 
there is a rational number. Consequently the set of rational numbers is said 
to be dense in R. (See Section 2.1.) Similarly, the set of irrational numbers is 
dense in R. 

Another important property is that every real number is an element of an 
interval of the form [«,« + !] where n is an integer. 
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A fundamental defining property of the real number system is the least 
upper bound property: Every subset of M that is bounded from above has 
a least upper bound. Equivalent to this property is the greatest lower bound 
property: Every subset of M that is bounded from below has a greatest lower 
bound. We denote the least upper bound and greatest lower bound of a set A 
by lub(A) and glb(A), respectively. 

We use the following standard notation for intervals in M: 


(a,b) 
[a,b) 
(a, oo) 
(- 00 , b) 


{x e R | a < x < b) [a, b] 

[x e R. | a < x < b) ( a,b ] 

[x e M | a < jc) [a, oo) 

[x e R. | x < b) (— oo, b ] 


{jc e R | a < x < b] 
{x 6 R | a < x < b) 
[x e R | a < x} 

{x G R | x < b] 


The intervals ( a , b), (—oo, b), and (a, oo) are called open intervals; the 
intervals [a,b], (—oo, b], and [a, oo) are called closed intervals; and the 
intervals [a, b) and (a, b] are called half-open intervals. Intervals of the form 
[a, b] are also referred to as closed bounded intervals. 

For sets A and B, we say that A is a subset of B, denoted A C B, if every 
element of A is also an element of B. Sets A and B are equal, written A — B, 
if both A C B and B C A. If A C B but A does not equal B, then we say that 
A is a proper subset of B. 

We frequently work with collections of sets, particularly indexed collec¬ 
tions of sets, defined as follows: 


DEFINITION 0.1 . Let D be a set, and assume thatfor each element d in D we 
have a set Sj. The collection C consisting of the sets Sd, one for each d € D, 
is called an indexed collection of sets or a collection of sets indexed by D. 
We refer to D as the indexing set for the collection. We denote the collection 
C by [Sd)deD or just {5^} when the indexing set is clear or does not need to be 
specified. 


EXAMPLE 0.1 . Let T be the set of towns in Arkansas, and for each t e T, let 
B t denote the set of barbers who run a business in town t. Note that two sets B t 
and B,' have a barber in common if there is a barber who runs a business in both 
town t and town t'. The collection {B t } te T is a collection of sets of barbers, 
with each set in the collection representing the barbers who run a business in 
a town in Arkansas. 


A special case of an indexed collection of sets {SdjdeD occurs when the 
indexing set is a set of positive integers, D = [m, m + 1,... , n}. In that case 
we also express the collection as { SdYd =m • i nc i u< d e here the possibility that 
m — —oo or n = oo. 

DEFINITION 0.2. A collection of sets {SdYd- _ m is said to be nested if Sd+\ C 
Sdfor all d. 


EXAMPLE 0.2. The collection of intervals [—£, £], with n € Z+, is a nested 
collection of sets. 
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DEFINITION 0.3. 

(i) Let U and V be sets. The union ofU and V is the set 

U U V = [x | x € U or x e V}, 
and the intersection of U and V is the set 

UC\V = {x\xeU andx e V}. 

(ii) Let [Ud)deD be an indexed collection of sets. Then the union of 
the sets Ud is the set 

(^J Ud = [x | x € Ud for some d e D}, 
deD 

and the intersection of the sets Ud is the set 

|^| Ud — {x | x € Ud for all d e D }. 
deD 

We also denote these unions and intersections by (J Ud and ("] Ud, 
respectively. We refer to this type of union as an arbitrary union 
and this type of intersection as an arbitrary intersection. 


EXAMPLE 0.3. Let {B,} fe r be the collection of sets of barbers from Exam¬ 
ple 0.1. Then |J deD Ud is the set of barbers who run a business in some town 
in Arkansas. And if Q d eD Ud is not the empty set,- then each barber in this 
intersection runs a business in every town in Arkansas. 


EXAMPLE 0.4. For each r in K+, let U r = [— r, r]. Then (JreR+ Ur — R 
an d f\eR + Ur = {0}. 


If the indexing set for { Ud D is a set of integers, D = [m,m + l ,...,«}, 
then we also express the union and intersection of the sets in the collection as 
U n d=m Ud and n^=m Ud, respectively. In the case where neither m = — oo 
nor n = oo, we refer to these as a finite union and a finite intersection. 

DEFINITION 0.4. 

(i) Let A and B be sets. If A f) B = 0, then we say that A and B are 
disjoint. 

(ii) If {Ad} deD is a collection of sets, and every pair of sets in the 
collection is disjoint, then we say that the sets in the collection are 
mutually disjoint. 


The following lemma will be helpful to us when proving that a set is a 
union of a particular collection its subsets: 
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LEMMA 0.5. The Union Lemma. Let X be a set and C be a collection 
of subsets of X. Assume that for each x € X, there is a set A x in C such that 
x € A x . Then UxeX = (See Figure 0.13.) 



FIGURE 0.13: The set X is the union of the subsets A x . 

Proof We prove that |J A x c X and X c ij A x . First, since each A x 
is a subset of X, it follows that (J Ajc CZ 

Now suppose that y e X. There exists A y in C such that y e A y . 
Thus y € A y C IJ A x . Since yeX implies that y € EJ A x , it follows 
that X C IJ A x . Hence A x = X. ■ 


EXAMPLE 0.5. Consider the collection of intervals in K given by C — 
{[j, 5 4- l]} se z- Every x € K is an element of [s, s + 1] for some s e Z, 
and therefore the Union Lemma implies that K is the union of the intervals in 
C. (See Figure 0.14.) 

-] f- 1 E- i E-] 

E- i E-] E- ] E- 

-,- 1 - \ - 1 - 1 - 1 - 1 - 

-3-2-10 1 2 3 

FIGURE 0.14: The set of real numbers is the union of all intervals of the form 
[s, s + 1] for s e Z. 


DEFINITION 0.6. Given two sets A and B, we define the complement of B 
in A to be the set A — B = {x \ x e A and x & B). 

The complement of B in A is a subset of A. For example, if A c B, then 
A — B = 0, and if A and B are disjoint, then A — B = A. (See Figure 0.15.) 

In subsequent chapters, we often work within a fixed set X. In that case, 
given a subset B of X, we say “the complement of B” instead of “the comple¬ 
ment of B in X”. 

The complement of B in A is also called the difference between A and 
B and sometimes is denoted A\B in mathematics literature. 
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FIGURE 0.15: Various possibilities for the complement of B in A. 


DEFINITION 0.7. Given two sets A and B, we define the product of A and 
B to be the set 


A x B = {(a, b) | a e A, b e B}. 

The elements of A x B are ordered pairs of elements, one from A and one 
from B. 


IMPORTANT NOTE: The expression (a, b) is used to denote both an interval 
in M and an element in a product Ax. B. It should be apparent from the context 
which meaning is intended. 

Notice that if A has m elements and B has n elements, then the product 
A x B has mn elements. We also work with finite products of sets, defined as 
follows: 

DEFINITION 0.8. The product of sets A\, A 2 ,... , A n is the set 

Ai x A 2 X ... x An = {{ai,a 2 ,... ,a n ) | a/ e A iy fori = 1,2. n}. 

The elements of the product are ordered n-tuples of elements from the sets Ai. 

The following are some important relationships associated with sets and 
the aforementioned relations and operations. Throughout the text we use these 
and other similar relationships. 


THEOREM 0.9. For sets A, 

Distributive Laws: 

AD(B 
AU(B 
A x (B 
A x (B 
A x (B 

DeMorgan’s Laws: 

A — (B 
A — (B 


= (A n B) u (A n C) 
= (A U B) n (A U C) 
= (A x B) IS (A x C) 
= (A x B) n (A x C) 
= (A x B) - (A x C) 

= (A — B) n (A — C) 
= (A — B) U (A — C) 


B, and C, the following laws hold: 


UC) 

nc) 

UC) 

nc) 

-c) 

UC) 

DC) 
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0.4 Euclidean Space 

One of the most common sets used in topology is R n or Euclidean n-space. 
The plane, denoted R 2 , is the set of ordered pairs of real numbers, 

R 2 = K*i,* 2) I *i,*2 e R}- 

Thus R 2 is the product, R x R. (See Figure 0.16.) 



FIGURE 0.16: The plane JR 2 = {(*i, * 2 ) I x\, e R}. 

In general, R" is the product of n copies of the real line. It is the set of 
n-tuples of real numbers, 

R" = l(xi,X 2 ,... ,x„ ) 1*1,*2. x„ gR}. 

We call R" Euclidean n-spaceorjustn-space for short. Thepoint(0,0, ...,0) e 
R" is called the origin and we denote it by O. 

The standard means for measuring distance in R" makes use of the Eu¬ 
clidean distance formula, defined as follows: For p = ( p\ . p n ) and 

q — iqi,... , q n ), the distance between p and q is 

dip, q) = yjip\ ~ q\) 2 + ■ ■ ■ + ip n ~ qn) 2 , 
as illustrated for R 2 in Figure 0.17. 

The Euclidean distance formula satisfies three important properties that 
make it what is known as a metric: 

(i) For all p,q G R", d(p, q) > 0 and dip, q) = 0 if and only if 
p = q. 

(ii) For all p,q G R”, dip, q) = diq, p). 

(iii) For all p, q, r G R", dip, r) < dip, q) -I- diq, r). 

We discuss metrics and their relation to topology in Chapter 5. We regard 
each x in R" as a point in n -space, but we can also regard it as a vector based 
at the origin in n -space. Either way, for each x = ix\,... , x n ) we define the 
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FIGURE 0.17: Measurina distance in R 2 . 


norm of jc by |jc | = ^Jx\ + ... + x%. We can think of the norm of x as either 
the distance from the origin to x or the length of x as a vector. 


DEFINITION 0.10. A set A e M" is called bounded if there exists b € R 
such that |jc| < b for all x e A. 

If a set A € R” is bounded, then there exists d* e R such thald(p, q) < d* 
for all p,q e A. Thus, in a bounded set A in R' ! , the distance between all pairs 
of points in A is bounded from above. 

DEFINITION 0.11. Aset A C R" is called convex if for every p,q e A the 
line segment between p and q lies in A. 

For example, in Figure 0.18 sets A and B are convex subsets of the plane, 
but set C is not since there are pairs of points in C such that the line segments 
between the points do not lie in C. 



DEFINITION 0.12. 

(i) A half space in R” is a set of points (;ci,... , x n ) satisfying a linear 
inequality a\x\ + ... + a n x n < b where b and each a,- are real 
numbers and at least one of the a,- is nonzero. 

(ii) A polyhedron in R” is a bounded subset ofR n that can be expressed 
as an intersection of half-spaces. 
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In Figure 0.19 we show one-dimensional, two-dimensional, and three- 
dimensional polyhedra. A one-dimensional polyhedron is a line segment. A 
two-dimensional polyhedron is a polygon. Its perimeter is made up of vertices 
and edges. A three-dimensional polyhedron has a surface made up of polygonal 
faces that meet at edges and vertices. 



FIGURE 0.19: One-dimensional, two-dimensional, and three-dimensional polyhe¬ 
dra. 


DEFINITION 0.13. The n-sphere, denoted S n , is the set of points that are 
distance 1 away from the origin in R" +1 . Therefore, 

S n = {(* 1 ,... ,-Cn+i) e M n+1 | xf + ... + x 2 +] = 1}. 

The 1-sphere, S 1 , is also called the circle, and the 2-sphere, S 2 , is referred to 
simply as the sphere. (See Figure 0.20.) 



FIGURE 0.20: The 0-sphere, 1-sphere (circle), and 2-sphere. 


If x = (jci, ... , x n+ i) is a point on the n-sphere, then the point —x = 

(—X] . —x n +i) is called the antipode of x or the point antipodal to x. 

Thus, the antipode of x is the point on the n-sphere that is opposite x, through 
the origin. 


DEFINITION 0.14. Define B n , the n-ball, to be the set 

B n = {(*i. x n ) e M" | xf + ... +xl < 1). 
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The 2-ball is referred to as the disk. Further, define B n , the open n-ball, to be 
the set 

B n - {(* 1 ,... ,x„) € K" \x\ + ...+xl < 1}. 

The open 2-ball is referred to as the open disk. 

0.5 Relations 

DEFINITION 0.15. Given sets X and Y, 

(i) A relation between X and Y is a subset R of the product X x Y. 
For a relation R, we say that x is related to y, denoted xRy, if 
(x, y) e R. 

(ii) A relation on X is a relation from X to X. 

An important type of relation used extensively in mathematics is the equiv¬ 
alence relation, defined as follows: 

DEFINITION 0.16. A relation ~ on X is called an equivalence relation if 
the following properties hold for all a,b,c in X: 

(i) a ~ a (reflexivity), 

(ii) a ~ b implies b ~ a (symmetry), 

(iii) a ~ b and b ~ c imply a ~ c (transitivity). 


EXAMPLE 0.6. Let D be the set of all dogs in Dubuque, Iowa, at this instant. 
Let a ~ b if dog a has the same mother as dog b. We show that ~ is an equiva¬ 
lence relation on D. Certainly each dog is related to itself, so reflexivity holds. 
Also, if Butterscotch has the same mother as Bingo, then Bingo has the same 
mother as Butterscotch. Thus symmetry holds. Finally, if Butterscotch and 
Bingo share a mother and Bingo and Rosco share a mother, then Butterscotch 
and Rosco also share a mother. Hence transitivity holds as well, and it follows 
that ~ is an equivalence relation. 


EXAMPLE 0.7. Once again, let D be the set of all dogs in Dubuque, Iowa, 
at this instant. But now define a relation L by a Lb if dog a has ever licked 
dog b. Although it is not a mathematical certainty, it is most likely true that 
reflexivity holds for this relation. Every dog has licked itself at one time or 
another. But the fact that Bucky licked Daisy does not mean that Daisy licked 
Bucky back, so symmetry does not necessarily hold. Similarly, Bucky having 
licked Daisy and Daisy having licked Zoe does not imply that Bucky ever 
licked Zoe. Thus transitivity need not apply either. Therefore L is a relation 
that is not necessarily an equivalence relation. 
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EXAMPLE 0.8. OnR 2 define (p\, pf) ~ (qi, qi) if P\ + p 2 = q\ + qi- We 
show that ~ defines an equivalence relation. 

(i) Since p\ + p 2 = p\ 4 - P 2 , it follows that (p\, P 2 ) ~ (pi, pi) for 
all (pi, P2) in R 2 . 

(ii) If (p\,p2) ~ (q\,qi) then P] + p 2 = q[ + q 2 , implying that 
q\ + <72 = Pi + P2> and therefore (q\,q 2 ) ~ (p\, p 2 ). Thus for 
all (pi,pi), (q\,qi) in R 2 , if (p\, pi) ~ (q\, qi), then (q\,q 2 ) ~ 
(P\,P2)- 

(iii) Finally, assume (p t , p 2 ) ~ (q\ , q 2 ) and (q\,q 2 ) ~ (r\,r 2 ). Then 

p\ + P2 - q\ + <?2 = n + r 2 , 

implying that (p\, p 2 ) ~ (r\,r 2 ). Hence for all (p\,p 2 ), (qi,qi), 
and (n, r 2 ) in R 2 , if (p \, p 2 ) ~ (q\,qi) and (q\,q 2 ) ~ (n, r 2 ), 
then (p\,p 2 ) ~ ( r u r 2 ). 

It follows that ~ is an equivalence relation. 

Now, given a point (tzi, a 2 ) in R 2 , can we identify all of the points that 
are equivalent to it? For (jci , x 2 ) to be equivalent to (a\, a 2 ), we must have 
x\ + x 2 = a\ + a 2 . Thus all of the points (x\,x 2 ) in the plane that are 
equivalent to (ai, a 2 ) lie on the line x\ +' x 2 . — a\ + a 2 . Furthermore, given 
any line xi + x 2 = c, every point on the line is equivalent to every other point 
on the line. 


The collection of lines described in Example 0.8 partitions the plane into 
separate subsets. We formalize this idea in the following definition. 

DEFINITION 0.17. A partition of a set X is a collection of mutually disjoint 
subsets of X whose union is X. 

As in Example 0.8, an equivalence relation ~ on a set X determines a 
partition of the set. For each x e X we define [x] = {p e X | p ~ x}. We 
call [x] the equivalence class of x under it is the set of all elements of X 
that are equivalent to x under the equivalence relation ~. The collection of all 
equivalence classes under ~ is a partition of X. For the equivalence relation 
in Example 0.8, the equivalence classes are the lines x\ + x 2 = c in R 2 , as 
illustrated in Figure 0.21. 

We can also reverse this process. A partition of a set X defines an equiv¬ 
alence relation on X. We consider two elements of X to be related if they are 
contained in the same subset of the partition. This yields an equivalence rela¬ 
tion and the resulting equivalence classes are the original sets in the partition. 


EXAMPLE 0.9. Returning to Example 0.6, we immediately see that the 
equivalence relation defined there divides the population of dogs in Dubuque 
into subsets where each subset is the set of all dogs in Dubuque that are offspring 
of a particular mother. 
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FIGURE 0.21: A partition of the plane into lines x\ + xi = c. 


0.6 Functions 

As with other areas of mathematics, functions play an important role in topol¬ 
ogy. In this section we review basic definitions and properties related to func¬ 
tions. 

DEFINITION 0.18. Let X and Y be sets. A function f from X to Y is a 
relation between X and Y that associates to each x in X a unique y in Y. We 
write f(x) = y and call y the image of x under f. The set X is called the 
domain of f, and the set Y is called the range of f. 

We use the notation / : X -> Y to indicate that / is a function from X to 
Y. We also call a function f : X Y a mapping and say that / maps X to 
Y. Further, if f(x) = y, then we say that / maps x to y. 

DEFINITION 0.19. For a function f : X —► Y and a subset A of X, define 
the image of A under f to be the set 

f(A) = {y 6 Y | y = f(x) for some x e A], 

as illustrated in Figure 0.22. The set f (X) is the image of the domain; we also 
refer to it as the image of f. 



FIGURE 0.22: The image of A under /. 
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A function / : X -»■ Y is called a constant function if its image consists 
of a single point c e Y. In that case f(x) = c for all x e X. For a set X, the 
identity on X is the function idx : X —► X, defined by idx(x) = x for all 
x & X. The identity on X just maps each point x in X to itself. 

DEFINITION 0.20. Given f \X-+Y and a pointy e Y, define f~ l (y), the 
preimage of y, to be the set [x e X \ f(x) = y}. Furthermore, given a subset 
W ofY, define f~ l (W), the preimage of W, to be the set {x e X | f(x) e IV}. 


For a function / : X -> Y, the preimage of a point y is the set of points in 
X that / maps to y, and the preimage of a set IV is the set of points in X that 
/ maps to points in IV. 


EXAMPLE 0.10. If / : R. -*■ R is defined by f(x) = x 2 , then / 1 ([0, 2]) = 
[-V?, n/ 2] and /-!([ 1, 3]) = [-V3, -1] U [1, V3]. 


The following theorems state some basic relationships between images and 
preimages and operations on sets. We use these and other similar properties 
throughout the text. 

THEOREM 0.21. If f : X —► Y is a function and A and B are subsets ofX, 
then 


(i) f(A U B) = /(A) U /(B). 

(ii) f (A n B) c /(A) n /(B). 

(Hi) /(A) - /(B) c f(A - B). 

THEOREM 0.22. If f : X -» Y is a function and V and W are subsets of 

Y, then 

(i) f~\V U W) = f~\V) U f~\W). 

(ii) f~\v n W) = f-\v) n f-'(W). 

(Hi) f~\v -w) = f~\v) - r\w). 

DEFINITION 0.23. 

(i) A function f : X —»■ Y is said to be one-to-one or injective if 
f(w) = f(x) implies w = x for every w, x e X. Thus f is 
injective if it maps distinct pairs of elements of X to distinct pairs 
of elements ofY. 

(ii) A function f : X -*■ Y is said to be onto or surjective if f (X) = Y. 
That is, f is surjective if for every y in Y there is an x in X such 
that f maps x to y. 
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(iii) A function that is both injective and surjective is called bijective or 
a bijection. 

Being related by a bijection is the basic equivalence between sets. A 
bijection / : X -» Y establishes a one-to-one correspondence between the 
elements of X and the elements of Y. 


EXAMPLE 0.11. Define / : R. -> R by f(x) = x 2 . Then / is not suijective 
since no negative numbers are in the image of /. Furthermore, / is not 
injective, since 1 ^ —1, but /(1) = /(— 1). The image of / is the set 
M+ U {0}. 


EXAMPLE 0.12. Define / : R -*■ R by fix) = 3x + 1. We show that this 
function is both injective and suijective. Hence, it is a bijection. 

First, suppose that w, x e M and f(w) = fix). Therefore 3u> + 1 = 
3x + 1, implying that 3 w = 3jc, and thus w = x. Since flw) = fix) implies 
w = x for w, x 6 M, it follows that / is injective. 

Now suppose that yet. Let x = Then 


fix) = 3(^±) + 1 = y - 1 + 1 = y. 

Thus for each yel there exists ul such that fix) = y, implying that / 
is suijective. 


EXAMPLE 0.13. Define g : R ->• E 2 by glx) = lx, x). Then g is injective 
because glw) = glx) implies that (u>, w) = lx, x), and therefore w — x. But 
g is not suijective because, for example, (0,1) is not in the image of g. 

_i 


EXAMPLE 0.14. Define h : M 2 —> MbyA(xi , X 2 ) = x\ . Then h is surjective 
because for x € R we have x = hlx, 0). On the other hand, h is not injective 
because hi 0,0) = hi 1,0), but (0,0) ^ (1,0). 


To every bijective function we associate an inverse function, defined as 
follows: 

DEFINITION 0.24. If f : X -+ Y is a bijection, then we define a function 
f~ l :Y—>X, where / _1 (y) is the unique x e X such that fix) = y. The 
function f~ l is called the inverse function to f. 
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EXAMPLE 0.15. In Example 0.12 the function / : R -» R, given by/(x) = 
3x + 1, is a bijection. The inverse function to / is the function f~ l : R -»• R 
defined by 


r'w 


x — 1 
3 


j 


Sometimes we need to restrict a function / : X -» T to a subset A of the 
domain X. This is accomplished as follows: 


DEFINITION 0.25. Given f : X —> Y and a subset A of X, the restriction 
of f to A is the function f\& : A —► Y defined by /UM = fix) for each 
x € A. 


EXAMPLE 0.16. Consider/ : R -»• [—1,1], defined by fix) = sin(x),and 
let A = [^, |-] c R. Note that / is not injective, but /U is. In fact f\& is 
bijective, and its inverse function is the function given by /1^ 1 00 = sin -1 (x). 


Given a function / mapping X to T and a function g mapping Y to Z, we 
can compose the two to obtain a function mapping X to Z. Specifically: 

DEFINITION 0.26. Given functions f : X —► Y and g : Y —>■ Z, the 
composition of g with f is the function go f : X Z defined by g o fix) = 
gifix)). 


EXAMPLE 0.17. If / : R —> R is defined by fix) = 3x 2 and g : R -► R + 
is defined by gix) = 5 — 2x, then go/:R^ R + is the function defined by 

g o fix) = 5 — 2(3x 2 ) = 5 — 6x 2 . 


j 


If / : X -*■ Y is a bijective function and / _1 : Y -*■ X is its inverse, 
then by the definition of /~ 1 it follows that f~ ] o f(x) = x for all jc € X and 
f ° f~ l iy) — y for all y e Y. 

Intuitively we think of a set X as being finite if we can count the elements 
of X, starting with number 1, and ending at a positive integer value. Using 
bijective functions, we make this idea precise as follows: 

DEFINITION 0.27. A set X is finite if X is empty or if there is a bijection 
/ : {1.2,. ..,«}—>■ X for some n e Z+. A set that is not finite is said to be 
infinite. 

A special type of infinite set is the countably infinite set, a prototype of 
which is the set of positive integers Z+. 
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DEFINITION 0.28. A set X is countably infinite if there is a bijection 
f : Z+ X. A set that is either finite or countably infinite is said to be 
countable. A set that is not countable is said to be uncountable. 

The set of integers, Z, and the set of rational numbers, Q, are both countable 
sets. The set of real numbers, R, and the set of irrational numbers are both 
uncountable sets. 

The following theorem provides a few important facts involving subsets, 
unions, and products of finite and countable sets. They will be needed at 
different points within the text. 

THEOREM 0.29. 

(i) A subset of a finite set is a finite set. 

(ii) A finite union of finite sets is a finite set. 

(iii) A product of finite sets is a finite set. 

(iv) A subset of a countable set is a countable set. 

(v) A countable union of countable sets is a countable set. 

( vi ) A product of countable sets is a countable set. 


DEFINITION 0.30. Given a set X, a sequence in X is a function 

f:Z + ^X. 

The range of the function f is called the range of the sequence. 

Given a sequence / : Z+ —► X, we usually write x„ for f(n) and express 
the sequence in the form {x\,X 2 , ...) or (x„), thinking of the sequence as a 
countably infinite listing of elements of X with repetition allowed in the list. 

The range ofa sequence can be infinite, as it is with the sequence (1, 5 ,...), 

or it can be finite, as it is with the sequence ( 1 , 0 , 1 , 0 ,...). 

DEFINITION 0.31. Given sequences (x„) and (y n ), we say that (y„) is a 
subsequence of (x n ) if there is a subset of the positive integers that, when put 
in the order j\ < 72 < • • • < jn < • • •» is such that y„ — xj n for all n 6 Z+. 
The sequence of positive integers (j n ) is called an indexing sequence for the 
subsequence (y n ) = (x Jn ). 


- ( 

EXAMPLE 0.18. The sequence (2, 4, 6, 8,. ..) is a subsequence of the se¬ 
quence (1, 2, 3,4,...). On the other hand, the sequence (1, 1, 3,3, 5,5, ... ) 
is not a subsequence of (1,2, 3, 4,...). 




Chapter 1 


Topological Spaces 


Topological spaces and continuous functions between them are the primary 
objects of study in the field of topology. In this chapter, we introduce topolog¬ 
ical spaces and some important concepts associated with them, including open 
sets (Section 1.1), bases (Section 1.2), and closed sets (Section 1.3). Also, in 
Section 1.4, we present two applications of topological spaces—one involv¬ 
ing digital image processing, the other concerning evolutionary proximity in 
biology. 

1.1 Open Sets and the Definition of a Topology 

For many years, prior to the formalization of the field of topology, mathe¬ 
maticians used the concept of an open set, a simple example of which is an 
open interval on the real line. But over time it was realized that many of the 
properties held by open sets on the real line could be said to hold for certain 
types of subsets in any set. Eventually, the essential properties were distilled 
out and the concept of a collection of open sets, called a topology, evolved into 
the following definition: 

DEFINITION 1.1. Let X be a set. A topology T on X is a collection of subsets 
of X, each called an open set, such that 

(i) 0 and X are open sets; 

(ii) The intersection of finitely many open sets is an open set; 

(iii) The union of any collection of open sets is an open set. 

The set X together with a topology T on X is called a topological space. 

Thus a collection of subsets of a set X is a topology on X if it includes the 
empty set and X, and if finite intersections and arbitrary unions of sets in the 
collection are also in the collection. 

IMPORTANT NOTE: There are two things that make up a topological space: 
a set, X, and a collection, T, of subsets ofX that forms a topology on X. To 
be properly formal, we should refer to a topological space as an ordered pair 
(X, T), but to simplify notation we follow the common practice of referring to 
the set X as a topological space, leaving it implicitly understood that there is 
a topology on X. 

You might have an intuition about what open sets look like if you are 
familiar with open intervals in the real line or open disks in the plane. Although 
these are open sets in what are referred to as the standard topologies on the real 
line and the plane, it is important to avoid typecasting open sets at this point. 
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Any sets at all can be an open set if we pick the topology appropriately. Let us 
look at a few examples. 


EXAMPLE 1.1. Let X be the three-point set {a,b, c}. We consider four 
different collections of subsets of X in Figure 1.1 and will investigate which 
ones are topologies. In each case assume that the collection contains the empty 
set and each of the circled sets. 




3 



4 




a b c 


FIGURE 1.1: Which collections are topologies on X = {a, b , c)? 

Both 0 and X are in each of the four collections. We can see that for 
collections 1, 2, and 3, the intersection of sets in the collection is also in the 
collection, and the union of sets in the collection is also in the collection. 
Hence, collections 1, 2, and 3 depict topologies on X. However, in the case 
of collection 4, the sets {a} and {c} are each in the collection, but their union 
{a, c} is not. So collection 4 does not depict a topology on X. 


EXAMPLE 1.2. Let X be a nonempty set. Define T = {0, X}.- Notice that 
T satisfies all three of the conditions for being a topology. However, if we 
remove either set, we no longer have a topology. Thus {0, X} is the minimal 
topology we can define on X. For obvious reasons, it is called the trivial 
topology on X. 


EXAMPLE 1.3. Let X be a nonempty set and let T be the collection of all 
subsets of X. Clearly this is a topology, since unions and intersections of 
subsets of X are themselves subsets of X and therefore are in the collection T. 
We call this the discrete topology on X. This is the largest topology that we 
can define on X. 


EXAMPLE 1.4. On the real line, R, define a topology whose open sets are 
the empty set and every set in R with a finite complement. (See Figure 1.2.) 
For example, U = R — {0, 3,7} is an open set. We call this topology the finite 
complement topology on R and denote it by R/ c . 
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FIGURE 1.2: An open set in the finite complement topology on R. 

Let us check that My c is a topology: 

(i) The empty set is an open set. Since the complement of M is empty, 
and therefore a finite set, it follows that M is an open set. 

(ii) Are intersections of finitely many open sets open? Assume we 
have a finite collection, U\,... , U„, of open sets. If any one of 
them is empty, then the intersection is empty and therefore is an 
open set. Thus assume that each t/,- is nonempty. Each t/, has a 
finite complement, and therefore 

U\ — M — F\ , U% = M — /*2> • • • , U n — M — F n , 

where each F, is a finite set. Now, Ui = M — |J"_j F,. Since 
a finite union of finite sets is a finite set, it follows that Ui is a 
subset of M with a finite complement and therefore is an open set. 
Thus, the intersection of finitely many open sets is an open set. 

(iii) Are unions of open sets open? Let {U a } be a collection of open 
sets. Therefore each U a either is empty or has a finite complement. 
If every U a is empty, then |J U a is empty, and therefore is an open 
set. Thus assume that at least one U a is nonempty, say U a >. The set 
U a > has a finite complement. Note that U a i c [JU a , and it follows 
that (J U a has a finite complement since U a > does. Therefore [J U a 
is an open set. Thus, the union of arbitrarily many open sets is an 
open set. 


Given any nonempty set X we can similarly define the finite complement 
topology on X. We denote it by Xf C . 

We have now defined three different topologies on M: the trivial topology, 
the finite complement topology and the discrete topology. In the case of these 
three, the trivial topology, with fewest open sets, is contained within the finite 
complement topology, which is itself contained within the discrete topology. 
That is to say, every open set in the finite complement topology is also an open 
set in the discrete topology, but not vice versa. We say that the discrete topology 
is strictly finer than the finite complement topology. This terminology can be 
considered part of an analogy to gravel. The finer the gravel, the more ways 
there are to cement it together to obtain open sets. (See Figure 1.3.) 

More generally, we have the following definition: 

DEFINITION 1.2. Let X be a set and let T\ and T 2 be two topologies on X. 
IfT\ C Ti then T 2 is said to be finer than T\, and T\ is said to be coarser than 
Tj. Furthermore, ifTi is finer than T\ but not equal to T\ then To is said to be 
strictly finer than T\. Strictly coarser is defined similarly. 
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FIGURE 1.3: The finer the gravel, the greater the number of open sets that can be 
constructed from it. 

In general, two topologies on a given set X need not be comparable. Each 
may contain open sets that are not open sets in the other, and therefore neither 
topology would be finer than the other. (See Exercise 1.5.) 

To simplify discussions as we proceed, we use the following definition: 


DEFINITION 1.3. Let X be a topological space and x £ X. An open set U 
containing x is said to be a neighborhood ofx. 

The following theorem provides us with a particularly useful means for 
establishing whether or not a set is open. 


THEOREM 1.4. Let X be a topological space and let A be a subset of X. 
Then A is open in X if and only if for each x £ A, there is a neighborhood U 
ofx such that x £ U C A. (See Figure 1.4.) 



FIGURE 1.4: The set A is open in X if and only if every point in A has a neighborhood 
U that lies in A. 
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Proof. First suppose that A is open in X and x e A. If we let U = A 
then U is a neighborhood of x for which x e U c A. 

Now suppose that for every x e A there exists a neighborhood U x of 
x such that x € U x C A. By the Union Lemma (Lemma 0.5), it follows 
that A = \J x€ a U x . Thus, A is a union of open sets and therefore is 
open. ■ 

By definition, a set is open in a topological space if it is a member of the 
collection of sets that defines the topology. However, Theorem 1.4 provides 
us with an intuitive idea of what it means for a set to be open—specifically, a 
set is open if each point in the set has a neighborhood of points that lie in the 
set as well. 

Exercises for Section 1.1 

1.1. Determine all of the possible topologies on X = [a, b }. 

1.2. On the three-point set X = {a, b, c }, the trivial topology has two open sets and 
the discrete topology has eight open sets. For each of n = 3,... ,7, either find 
a topology on X consisting of n open sets or prove that no such topology exists. 

1.3. Prove that a topology T on X is the discrete topology if and only if {*) e T 
for all x e X. 

1.4. (a) Give an example of a space where the discrete topology is the same as the 

finite complement topology. 

(b) Make and prove a conjecture indicating for what class of sets the discrete 
and finite complement topologies coincide. 

1.5. Find three topologies on the five-point set X = {a,b,c, d, e) such that the first 
is finer than the second and the second finer than the third, without using either 
the trivial or the discrete topology. Find a topology on X that is not comparable 
to each of the first three that you found. 

1.6. Define a topology on R (by listing the open sets within it) that contains the open 
sets (0, 2) and (1, 3) and that contains as few open sets as possible. 

1.7. Let X be a set and assume p e X. Show that the collection T, consisting of 
0, X, and all subsets of X containing p, is a topology on X. This topology is 
called the particular point topology on X, and we denote it by PPX p . 

1.8. Let X be a set and assume p e X. Show that the collection T, consisting of 
0, X, and all subsets of X that exclude p, is a topology on X. This topology is 
called the excluded point topology on X, and we denote it by EPX p . 

1.9. Let T consist of 0, R, and all intervals (—oo, p) for p e R. Prove that T is a 
topology on R. 


1.2 Basis for a Topology 

In all of the preceding examples of topological spaces, we were able to specify 
the entire collection of open sets. In general this is difficult to do, so instead 
we specify a smaller collection of open sets, called a basis, and then generate 
the rest of the open sets from this collection. 
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DEFINITION 1.5. Let X be a set and B be a collection of subsets of X. We 
say B is a basis (for a topology on X) if the following statements hold: 

(i) For each x in X, there is a B in B such that x € B. 

(ii) If B\ and Z ?2 are in & and x € B\ fl B 2 , then there exists B 3 in B 
such that x € S 3 C B\ fl Bj. (See Figure 1.5.) 



FIGURE 1.5: For every point x in the intersection of two sets in the basis, there is 
a set in the basis containing x, contained in the intersection. 

We call the sets in B basis elements. Paraphrasing the two conditions for 
a basis, we have 

(i) Every point in X is contained in a basis element. 

(ii) Every point in the intersection of two basis elements is contained 
in a basis element contained in that intersection. 

Next we describe how a basis generates a topology. First, however we look 
at a couple of examples of bases. 


EXAMPLE 1.5. On the real line M, let B = {(a, b) C K | a < b), the set of 
open intervals in R, Certainly every point of E is contained in an open interval 
and therefore is contained in a set in B. Furthermore, if two open intervals 
intersect at all, they do so in an open interval, so a point in the intersection of 
two sets in B is contained in a set in B that is contained in the intersection. 
Thus B is a basis. 


EXAMPLE1.6. Let X be a set and let B = {{x} \x e X). Every* e X lies in 
the set {*} in B, so the first condition for a basis is satisfied by B. Furthermore, 
every pair of distinct sets in B is disjoint, so the second condition for a basis is 
satisfied automatically. Therefore B is a basis. 


What is the advantage of having a basis on a set XI It allows us to easily 
define a topology. 
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DEFINITION 1. 6 . Let B be a basis on a set X. The topology T generated by 
B is obtained by defining the open sets to be the empty set and every set that is 
equal to a union of basis elements. 

We need to check that the resulting collection T is actually a topology; we 
do that after looking at a couple of examples. 


EXAMPLE 1.7. Let X be a nonempty set and B — {{x} | x € X}. In 
Example 1.6 we showed that B is a basis for a topology on X. What is the 
topology T that B generates? Notice that every subset U of X is the union of 
the single-point subsets corresponding to its elements. Therefore every subset 
of X is an open set in T, implying that B generates the discrete topology on X. 


EXAMPLE 1.8. On the real line R, let B = {( a,b) C R | a < b). In 
Example 1.5 we showed that B is a basis for a topology on R. The topology 
generated by B is called the standard topology on R and is the topology most 
commonly used on the real line. Open sets in the standard topology on R are 
unions of open intervals. When we refer to R as a topological space, unless 
otherwise specified, it will always be assumed that the topology is the standard 
topology. 


Next we show that the topology generated by a basis actually is a topology. 
To begin we need the following lemma. 

LEMMA 1.7. Let B be a basis. Assume that B\,.... B n € B and that x e 
n?=i Bi- Then there exists B' e B such that x e B' c p|”_i #<■ 

Proof. We prove this by induction on n, starting with n = 2. The n = 2 
case holds by the second condition in the definition of a basis. 

Assume that the result is true for n — 1. Suppose that the sets 
Si, ... , B„ are in B and that x e p)" =1 S,-. Then x e P)"=/ #/> and 
the induction hypothesis implies that there exists B* e B such that x e 
B* c niZl Bi. Now x e B* n B„; therefore by the second condition in 
the definition of a basis, there exists S' e B such that x e S' c S*flB„. 
Since S* C p)"”/ &i follows that x e S' C fl/=i &i’ Thus, if the 
result holds for n — 1, then it holds for n, and by induction the lemma 
follows. ■ 

THEOREM 1.8. The topology T generated by a basis B is a topology. 

Proof. The empty set 0 is in T by definition. Since every point in X is 
contained in some basis element, it follows that X is the union of all of 
the basis elements and therefore is in T. 

Next we show that a finite intersection of sets in T is in T. Thus, let 
V = U\ fl... fl U n where each U-, is in T. If any one of the {/,• is empty, 
then so is V, and in this case V is in T. Thus assume that each £/,• is a 
union of basis elements. We show that V is a union of basis elements 
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as well. Let x e V be arbitrary. Then x e £/,■ for all i. Since each {/; 
is a union of basis elements, there exists a basis element B,- such that 
x e Bj c Uj for each i. Then x e P )" =1 B £ ; therefore, by Lemma 1.7, 
there exists a basis element B.* such that x e B x C fj;=i Si C V. It 
follows from the Union Lemma (Lemma 0.5) that V = U.*ev 5*. and 
therefore V is a union of basis elements. Thus, a finite intersection of 
sets in T is in T. 

Finally, we show that an arbitrary union of sets in T is in T. Let 
V = U U a where each U a is either the empty set or a union of basis 
elements. If each U a is empty, then so is V; on the other hand, if at 
least one U a is nonempty, then V is a union of basis elements, since it is 
the union of all of the basis elements making up the U a ' s. Therefore an 
arbitrary union of sets in T is in T. 

Thus, the collection of sets T is a topology, and we are justified in 
calling it the topology generated by the basis B. D 

IMPORTANT NOTE: Not only does a basis generate a topology, but each 
basis element is also itself an open set in the topology generated by the basis. 


EXAMPLE 1.9. On R, let B = {[a, b) c R | a < b). The collection B is a 
basis for a topology on M. (See Exercise 1.10.) We call the topology generated 
by this basis the lower limit topology since each basis element contains its 
lower limit. We denote M with this topology by R/. 

The intervals [0, 2) and (0, 2) are both open in R/. The former is open 
since it is a basis element; the latter is open since it is the union of the basis 
elements B, = [}, 2), where i = 1, 2, 3,_ 


We can similarly define the upper limit topology on R via the basis B = 
{(a, b] C R | a < b). 

We have now introduced six topologies on the real line: the standard topol¬ 
ogy, the upper limit topology, the lower limit topology, the finite complement 
topology, the discrete topology, and the trivial topology. Of course, the triv¬ 
ial topology is the coarsest of these topologies, and the discrete topology is 
the finest, but how do the rest of these topologies compare? We ask you to 
investigate this question in Exercise 1.13. 

On the set of integers, Z, the standard topology turns out to be the discrete 
topology. (We will see why in Section 3.1.) There are, however, nondiscrete 
topologies on Z that play a role in topology and its applications. In the next 
example we introduce a topology on Z that is useful in digital image processing. 
We discuss this application further in Sections 1.4 and 11.3. 


EXAMPLE 1.10. For each n e Z, define 

if n is odd > 

^ ’ {n — 1, n, n + 1 } if n is even. 

We illustrate these sets in Figure 1.6. The collection B = {B(n) | n € Z} is a 
basis for a topology on Z. (See Exercise 1.14.) The resulting topology is called 
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the digital line topology, and we refer to Z with this topology as the digital 
line. 



-7 -6 -5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 

FIGURE 1.6: Basis elements for the digital line topology. 


j 


By definition, the open sets in the topology generated by a basis are the 
sets obtained by taking unions of basis elements. The next theorem provides 
another simple way to describe the sets in such a topology. 

THEOREM 1.9. Let X be a set and B be a basis for a topology on X. Then 
U is open in the topology generated by B if and only if for each x € U there 
exists a basis element B x € B such that x 6 B x C U. (See Figure 1.7.) 



FIGURE 1.7: The set U is open if and only if every x in U is contained in a basis 
element contained in U. 

Proof. Suppose that U is an open set in the topology generated by B 
and that x € U. Since U is a union of basis elements, there is at least 
one basis element B' making up this union that contains x. Clearly then 
x eB' CU. 

Now suppose that U C X is such that for each x e U there exists 
B x € B such that x e B x C U. By the Union Lemma (Lemma 0.5), 
U = U KeU B x , and therefore U is a union of basis elements. Thus, U 
is an open set in the topology generated by B. ■ 


EXAMPLE 1.11. We have examined a few topologies on the real line. Next, 
let us look at the plane R 2 . For p = (p\, pz) and q = (q\, < 72 ), two points in 
R 2 , we introduced the Euclidean distance formula 

d(p, q) = yj(pi - q \) 2 + (P2 ~ qi ) 2 
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in Section 0.4. For each x in R 2 and s > 0, define 

B(x, s) = [p € R 2 | d(x, p) < £}. 

The set B(x,s) is called the open ball of radius s centered at x. Let 

B = [B{x, s) | x € R 2 , s > 0}. 

So B is the collection of all open balls associated with the Euclidean distance 
d. In Theorem 1.10 we show that B is a basis for a topology on R 2 . We call the 
topology generated by B the standard topology on R 2 . It is the most common 
topology used on R 2 . 

THEOREM 1.10. The collection B = {£(*, e) \ x e R 2 , e > 0} is a basis 
for a topology on R 2 . 

Before proceeding with the proof of this theorem, we prove the following 
supporting lemma: 

LEMMA 1.11. Let y be in R 2 and assume r > 0. Then for every x 6 B(y, r) 
there exists an e > 0 such that B(x, s ) c B(y, r ). 

The lemma indicates that if a point x is in some open ball B in IK 2 , then 
there is an open ball centered at x contained in B as well. (See Figure 1. 8 .) 
We will use the lemma in establishing that the collection of open balls in R 2 
satisfies the second property of a basis. 



FIGURE 1.8: Every point ;t in B(y, r) is the center of some open ball contained in 

Proof of Lemma 1.11. Suppose* € B(y, r). Then J(x, y) < r. Choose 
e such that 0 < e < r — d(x, y). We claim that B(x, e ) C B(y, r). Sup¬ 
pose z € B(x, e). Then we have 

d(y,z ) < d{y,x) +d(x,z) 

< d(y,x) + s 

< d(y,x) + r - d(x, y) 

= r. 

Therefore z 6 B(y, r), and it follows that B(x, s) C B(y, r), completing 
the proof of the lemma. ■ 

Proof of Theorem 1.10. Since each i e R 2 is contained in the basis 
element B(x, 1), the first condition for a basis is satisfied. 
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Now we need to verify that, if x is in the intersection of two basis 
elements, there is then a basis element containing x and contained in the 
intersection. Suppose x e B(p, n) D B(q, r 2 ). By Lemma 1.11 there 
exist £i, £2 > 0 such that B(x, £j) C B(p, n) and B(x, £ 2 ) C B(q, r 2 ). 
Let £ = min{£i, £ 2 }. Then 

B(x, £) C B(x, £ 1 ) n B(x, £ 2 ) C B(p, n) n B(q, r 2 ), 
and it follows that B satisfies the second condition for a basis. 

Therefore B is a basis for a topology on R 2 . ■ 

In the topology on R 2 generated by B the open sets are sets that can be 
expressed as a union of open balls. For example, open balls, open rectangles, 
and open half planes, as illustrated in Figure 1.9, are all open sets. (See 
Exercise 1.17.) 



FIGURE 1.9: Open sets in the standard topology on R 2 . 

We can similarly define the standard topology on R" using the Euclidean 
distance formula 

d(p, q) = 1 / (P\ ~ q \) 2 + (P 2 ~ qi ) 2 + ■ ■ ■ + (Pn ~ qf) 2 - 
We define an open ball to be a set of the form 

B(x, £) = [p e R" | d(x, p) < £}, 

and we define a basis B for the topology to be the collection of all open balls 
in R" given by B = {B(x, s) \ x e R", £ > 0}. 

Not only is the standard topology on R 2 generated by the collection of 
open balls in R 2 , but, as the following theorem indicates, it is also generated 
by the collection of open rectangles in R 2 , where an open rectangle is a set of 
the form (a, b) x (c, d). 

THEOREM 1.12. On the plane R 2 , let 

B = [(a, b) x (c, d) C R 2 | a < b, c < d}. 

Then B is a basis, and the topology T' generated by B is the standard topology 
on R 2 . 


Proof. See Exercise 1.16. 
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Theorem 1.12 demonstrates that a given topology can be generated by more 
than one basis. Furthermore, it demonstrates that the geometric shape of the 
individual basis elements is not significant in determining the open sets in the 
topology. To be specific, the topology on the plane generated by the collection 
of open balls is the same as that generated by the collection of open rectangles. 
We can also generate that same topology by a collection of diamonds or even 
hearts, as illustrated in Figure 1.10. 



FIGURE 1.10: Many different bases can generate the same topology. 

In a fashion similar to that used in Theorem 1.12, it is not difficult to show 
that the standard topology on W 1 is also generated by a basis consisting of open 
rectangles (a\, b\) x ... x (a„, b„). 

The following theorem provides us with a quick way of proving that a 
collection of open sets in a topology is a basis for the topology. We will find 
this theorem useful when we introduce some new constructions for topologies 
in Chapter 3. 

THEOREM 1.13. Let X be a set with topology T, and let C be a collection 
of open sets in X. If, for each open set U in X and for each x e U, there is 
an open set V in C such that x e V C U, then C is a basis that generates the 
topology T. 

Proof. First, we check that C is a basis. Let x be in X. Since X is itself 
an open set, there is an open set V in C such that x e V C X. Therefore 
every point in X is contained in some open set V in the collection C. 
Thus the first condition for a basis is satisfied. 

Suppose now that x is in the intersection of two of the open sets in 
C, V\ and V 2 - Then x e V\ fl V 2 , which is itself an open set. According 
to our hypothesis, there must be an open set V 3 in C such that x e V 3 C 
V\ n V 2 . Therefore C is a basis. 

Now, we need to check that T, the topology generated by C, co¬ 
incides with T. Suppose U is open in T. Then by the hypothesis, for 
every x € U there is an open set V in C such that x e V C U. So by 
Theorem 1.9, U is open in T', the topology generated by C. Therefore 
T c T 7 . Now suppose that W is an open set in T'. Then W is a union of 
open sets in C, all of which are open in T. But every union of open sets 
in T is open in T, so W is also open in T. Thus T cT and it follows 
that T and T coincide. ■ 
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EXAMPLE 1.12. We can immediately see from Theorem 1.13 that the open 
hearts in Figure 1.10 form a basis for the standard topology on R 2 . First, note 
that the hearts themselves are open sets in the standard topology. Now, suppose 
that x is in an open set U . Then there is an open ball B containing x, contained 
in U . If we choose a heart H containing x that is small enough to fall entirely 
in B, then we have x e H c U. Therefore, by Theorem 1.13, the collection 
of hearts is a basis for the standard topology on R 2 . 


Exercises for Section 1.2 

1.10. Show that B — {[a, b) c R | a < b] is a basis for a topology on R. 

1.11. Determine which of the following collections of subsets of R are bases: 

(a) C\ = {(n, n + 2) c R | n € ZJ 

(b) C 2 = {[a, b] C R | a < b] 

(c) C 3 = {[a, b] C R | a < b] 

(d) C 4 = {(—jc, jc) c R | x e R} 

(e) C 5 = {{a, b) U [b + 1} c R | a < b) 

1.12. Determine which of the following are open sets in R/. In each case, prove your 
assertion. 

A = [4, 5) B = {3} C = [1,2] £> = (7,8) 

1.13. Consider the following six topologies defined on R: the trivial topology, the 
discrete topology, the finite complement topology, the standard topology, the 
lower limit topology, and the upper limit topology. Show how they compare 
to each other (finer, strictly finer, coarser, strictly coarser, noncomparable) and 
justify your claim. 

1.14. Let B be the collection of subsets of Z used in defining the digital line in 
Example 1.10. Show that B is a basis for a topology on Z. 

1.15. An arithmetic progression in Z is a set 

A a ,b = {• • • , a — 2 b, a — b, a, a + b, a + 2 b ,...} 
with a, b € Z and b ^ 0. Prove that the collection of arithmetic progressions 
A = {A a ,fc | a, b e Z and b / 0} 

is a basis for a topology on Z. The resulting topology is called the arithmetic 
progression topology on Z. 

1.16. Prove Theorem 1.12: On the plane R 2 , let 

B = {(a, b) x (c, d) C R 2 | a < b, c < d}. 

(a) Show that B is a basis for a topology on R 2 . 

(b) Show that the topology, T', generated by B is the standard topology on 
R 2 . (Hint: if T is the standard topology, show that TcT' and T' c T.) 

1.17. An open half plane is a subset of R 2 in the form {(x,y) e R 2 | Ax + By < C] 
for some A, B, C € R with either A or £ nonzero. (See Figure 1.11.) Prove 
that open half planes are open sets in the standard topology on R 2 . 
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FIGURE 1.11: An open half plane in R 2 . 

1.18. Show that the collection {(—oo, q) C R | q rational} is a basis for the topology 
in Exercise 1.9. 

1.19. (a) Show that the collection {{a} x ( b , c) c R 2 | a, b, c e R) of vertical 

intervals in the plane is a basis for a topology on R 2 . We call this topology 

the vertical interval topology. 

(b) Compare the vertical interval topology with the standard topology on R 2 . 


Supplementary Exercises: Creating a Topology from a Collection of Subsets 

Throughout these exercises, let X be a set and let C be a collection of subsets of X 
whose union equals X. Of course, in general C is not a topology. However, we would 
like to be able to construct a topology so that the subsets in C are open sets in the 
topology. We could just take the discrete topology, where all subsets of X are open. 
However, we prefer to construct a topology containing C with as few extra open sets 
as possible, a “smallest” topology containing C. 

Let Be be the collection of all subsets of X that can be expressed as an intersection of 
finitely many of the sets from C. 

SE 1.20. Show that Be is a basis for a topology on X. 

Let Tc be the topology generated by the basis Be- The topology Tc consists of arbitrary 
unions of finite intersections of sets in C . The collection C is what is known as a subbasis 
for the topology Tc- 

SE 1.21. Prove that every set in C is an open set in the topology Tc; that is, prove that 
CcT c . 

Now we verify that Tc is the smallest topology containing C. 

SE 1.22. Let T be a topology on X containing C. Thus assume that every set in C is 
an open set in T. Prove that Tc C T. 

For the remaining exercises assume that each x e X is contained in at most finitely 
many sets in C. (For example, this occurs if X is a finite set.) For each x e X, let B x 
be the intersection of the sets in C containing x. 

SE 1.23. Show that the collection B' c = [B x } x€ x is a basis for Tc- 
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In general, a topological space does not have a minimal basis, that is, a basis containing 
as few basis elements as possible. However, in the setting we are considering here, 
where each x s X is contained in finitely many of the sets in C, we can show B' c is the 
minimal basis for Tz as follows: 

SE 1.24, Prove that if B is a basis for Tc, then B' c c B. 


1.3 Closed Sets 

So far, we have been talking exclusively about open sets. Now it is time to 
look at the complementary concept and introduce closed sets. Their definition 
is quite simple. 

DEFINITION 1.14. A subset A of a topological space X is closed if the set 
X — A is open. 


EXAMPLE 1.13. Consider R with the standard topology. Then, as illus¬ 
trated in Figure 1.12, since (0, 1) is open, (—oo, 0] U [1, oo) is closed; since 
(—oo, a) U ( b , oo) is open, [a, b\ is closed; and, since (—oo, c) U (c, oo) is 
open, {c} is closed. 
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FIGURE 1.12: Closed sets in R with the standard topology. 


DEFINITION 1.15. 

(i) For each x in R 2 and e > 0, define the closed ball of radius e 
centered at x to be the set 

B(x, e) = {y e R 2 | d(x, y) < a}, 
where d(x,y) is the Euclidean distance between x and y. 

(ii) If [a, b] and [c, d] are closed bounded intervals in R, then the 
product [ a,b ] x [c, d] c R 2 is called a closed rectangle. 

THEOREM 1.16. Closed balls and closed rectangles are closed sets in the 
standard topology on R 2 . 

Proof. We ask you to prove that closed balls are closed sets in Exer¬ 
cise 1.26. Here, we prove that closed rectangles are closed sets in the 
standard topology on R 2 . 
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Let A = [a, b] x [c, d] be a closed rectangle in R 2 . To show that A is 
a closed set in the standard topology, we prove that R 2 - A is an open set. 
Note that R 2 — A can be expressed as the union of four open half-planes: 
{(x, y) | x < a}, {(x, y) \ x > b], {(x, y) \ y < c }, and {(x, y) | y > d}. 
Since each of these half-planes is an open set (see Exercise 1.17), and a 
union of open sets is an open set, it follows that R 2 — A is an open set. 
Hence, the rectangle A is a closed set. ■ 


EXAMPLE 1.14. Consider R 2 with the standard topology. In Figure 1.13 the 
top three sets are closed, and the bottom three are not. 



FIGURE 1.13: In R 2 with the standard topology, the top three sets are closed, but 
the bottom three are not. 

The spiral on the lower left is the graph of the polar-coordinates equation 
r = with 0 > 0. It winds out toward the circle S 1 , but it contains no point 
from S . The spiral is not closed because the point (1, 0) is in the complement, 
but no open ball containing ( 1 , 0 ) is a subset of the complement since every 
such open ball intersects the spiral. Thus the complement is not open, implying 
that the spiral is not closed. 


If a set C is closed, then by definition its complement is open. What can 
we say about the complement of an open set? Let U be an open set, and let 
K = X — U be its complement. Consider the complement of K. We have 
X — K = X — (X — U) = U, and U is an open set. Thus K, the complement 
of U . is closed. Therefore the complement of an open set is closed. 

In summary, the complement of a closed set is open by the definition of 
closed set, and the complement of an open set is closed by the argument just 
presented. (See Figure 1.14.) 


EXAMPLE 1.15. Let X be a set with the discrete topology. Every subset of 
X is an open set. Given an arbitrary set A in X, its complement is then open 
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since every set is open. Therefore A is closed and, since A was arbitrary, it 
follows that every subset of X is also a closed set in the discrete topology. 

_i 



FIGURE 1.14: If C is closed, its complement is open; if U is open, its complement 
is closed. 


EXAMPLE 1.16. Let the set {a, b, c, d } have the topology T as shown in 
Figure 1.15. Note that {6} is open and not closed, {a} is closed and not open, 
{a, b } is both open and closed, and [b, c } is neither open nor closed. 



FIGURE 1.15: A topology with open sets, closed sets, sets both open and closed, 
and sets neither open nor closed. 


j 


IMPORTANT NOTE: A common error is to mistake closed for not open, but 
as this last example illustrates, sets can be any combination of open and closed. 

Here is a riddle to help remember this: 

Question: How is a subset different from a door? 

Answer: A door must be open or closed. But a subset can be either, both, or 
neither. 


It is hard to resist math humor, but in the future, we will try. 










Section 1.3 Closed Sets 59 


Since closed sets are complementary to open sets, their properties are 
similar, but there are some fundamental differences. 

THEOREM 1.17. Let X be a topological space. The following statements 
about the collection of closed sets in X hold: 

(i) 0 and X are closed. 

(ii) The intersection of any collection of closed sets is a closed set. 

(iii) The union of finitely many closed sets is a closed set. 

Proof. See Exercise 1.33. ■ 

Thus, the collection of closed sets in a topological space X includes the 
empty set and X and is such that finite unions and arbitrary intersections of 
closed sets are closed sets. 

All single-point sets are closed in R" with the standard topology. However, 
there are topologies where this is not the case. For example, consider the 
topology T* on ]R generated by the basis of sets {[«, n + 1) cM|n e Z}. It 
is not difficult to see that no single-point set is closed in T*. 

Next we present a property of topological spaces that has some particularly 
nice consequences for spaces satisfying it. As we show below, these conse¬ 
quences include having single-point sets closed. We will see a few of the other 
convenient consequences of this property throughout the text. 

DEFINITION 1.18. A topological space X is Hausdorff if for every pair of 
distinct points x and y in X, there exist disjoint neighborhoods U and V of x 
and y, respectively. (See Figure 1.16.) 



FIGURE 1.16: In a Hausdorff space distinct points have disjoint neighborhoods. 

In his 1914 book, Grundzuge der Mengenlehre, Felix Hausdorff presented 
the first general axiom system for what was to become known as a topological 
space. His axioms essentially describe what we refer to as a basis for a topology. 
His system also included an axiom that is no longer contained in the axioms for 
a general topological space, but that now defines the special class of topological 
spaces that are known—appropriately enough—as Hausdorff spaces. 
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The Hausdorff property is easy to remember: In a Hausdorff space, each 
point in a pair of points can be “housed off” from the other by disjoint neigh¬ 
borhoods. 


EXAMPLE 1.17. The real line R with the standard topology is Hausdorff. 
Given two distinct points a and b, there are disjoint open intervals containing 
them. For example, if a < b, then the intervals 

!/ = (._ 1 ,^) andv = (^, 6+1 ) 
are disjoint and contain a and b, respectively. 


EXAMPLE 1.18. Every set X with the discrete topology is Hausdorff. If x 
and y are distinct points, then the sets {jc } and {y} are disjoint neighborhoods 
of x and y, respectively. 


As we have already indicated, Hausdorff spaces have a variety of useful 
properties, such as: 

THEOREM 1.19. If X is a Hausdorff space, then every single-point subset 
of X is closed. 

Proof. Suppose x € X. In order to show that {x} is closed, we show that 
X — {jc } is open. Let y e X — {*} be arbitrary. Since X is Hausdorff, there 
are disjoint neighborhoods U and V containing x and y, respectively. It 
follows that x V and therefore y € V c X — {x}. Since every 
y € X — {x} is in an open set contained in X — {x}. Theorem 1.4 implies 
that X — {x} is open. ■ 

Exercises for Section 1.3 

1.25. Prove that, in a topological space X, if U is open and C is closed, then U — C 
is open and C — U is closed. 

1.26. Prove that closed balls are closed sets in the standard topology on R 2 . 

1.27. The infinite comb C is the subset of the plane illustrated in Figure 1.17 and 
defined by 

C = {(a:, 0) | 0 < x < 1} U {(^r, y) \ n = 0,1, 2,... and 0 < y < 1}. 


(a) Prove that C is not closed in the standard topology on R 2 . 

(b) Prove that C is closed in the vertical interval topology on R 2 . (See Exer¬ 
cise 1.19.) 
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FIGURE 1.17: The infinite comb. 

1.28. Which sets are closed sets in the finite complement topology on a topological 
space X? 

1.29. Which sets are closed sets in the excluded point topology EPX p on a set X? 
(See Exercise 1.8.) 

1.30. Which sets are closed sets in the particular point topology PPX p on a set X? 
(See Exercise 1.7.) 

1.31. Show that a single-point set {n} is closed in the digital line topology if and only 
if n is even. 

1.32. Prove that intervals of the form fa, b) are closed in the lower limit topology 
on R. 

1.33. Prove Theorem 1.17: Let X be a topological space. 

(a) Prove that 0 and X are closed sets. 

(b) Prove that the intersection of any collection of closed sets in X is a 
closed set. 

(c) Prove that the union of finitely many closed sets in X is a closed set. 

1.34. On the five-point set {a, b, c, d, e] construct two topologies, one that is Haus- 
dorff (other than the discrete topology) and one that is not Hausdorff (other than 
the trivial topology). 

1.35. Show that K in the lower limit topology is Hausdorff. 

1.36. Show that R in the finite complement topology is not Hausdorff. 

1.37. Prove that the arithmetic progression topology on Z, introduced in Exercise 1.15, 
is Hausdorff. 


Examples of Topologies in Applications 

In this section we discuss two applications of topological spaces: one showing 
how topological spaces model digital image displays in digital topology, and 
the other showing how topological spaces are employed to model evolutionary 
proximity in molecular biology. 
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Digital Topology 

Digital topology is the study of topological relationships on a digital image 
display (for example, a computer screen). It plays a role in the field of digital 
image processing. The digital image display contains a rectangular pixel array, 
as illustrated in Figure 1.18. In digital topology, this array is modeled by what 
is known as the digital plane. We provide a specific definition hereafter. 
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FIGURE 1.18: The digital image display is a rectangular array of pixels. 


An important task in digital image processing is to determine features of an 
object from a digital image of it. For example, an optical character-recognition 
program reads a digital image of a character (such as in Figure 1.19) and at¬ 
tempts to determine the character represented by the image so that subsequently 
it can be used within a word-processing program. Topological properties of 
the image assist in determining the intended character. The fact that the image 
in Figure 1.19 encloses two regions helps distinguish the associated character 
from characters that do not. 
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FIGURE 1.19: A digital image of the letter B. 

In the initial development of the field of digital topology, work focused 
on defining and studying digital analogs of topological concepts (for example, 
connectedness and continuity) without having an underlying topology on the 
model of the digital image display. The 1979 paper, “Digital Topology,” by 
Azriel Rosenfeld (1931-2004), was one of the first introductions to these ideas. 
Later, topological spaces were found that appropriately model the digital image 
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display, enabling topological concepts to be used directly in the digital domain. 
(See [Kon], for example.) 

In this section we introduce the digital line and digital plane models and 
consider some basic topological properties associated with them. In Sec¬ 
tion 11.3 we examine an application of digital topology to digital image pro¬ 
cessing. 

The digital line, introduced in Example 1.10, models a one-dimensional 
digital image display. We assume that we have an infinite line of pixels, each 
of which corresponds to an odd integer. For each odd integer we have a basis 
element B(n ) = {«}. Consequently, each individual pixel is an open set in 
the digital line topology. In the digital line we also have structure that reflects 
the adjacency shared by consecutive pixels. We regard each even integer n as 
representing the boundary between the pixels at rt — 1 and n + 1. Associated 
to each even n we have a basis element B(n) = {n — 1, n, rt + 1}. 

In the digital line, every odd integer is an open set, and every even integer is 
a closed set (see Exercise 1.31). We view the digital line as a set of open pixels 
corresponding to the odd integers, along with the set of closed boundaries 
between the pixels corresponding to the even integers. (See Figure 1.20.) 



FIGURE 1.20: The digital line models the line of pixels and boundaries between 
them. 

The digital line is not Hausdorff since there is no pair of disjoint open sets 
U, V with n e U and n + 1 € V. 

Next, we introduce the digital plane. As with the digital line, the idea is 
to have an array of single-point open sets model the pixels in the digital image 
display and also to have additional points representing the boundaries between 
the pixels. We start with Z x Z, the product of two copies of the integers. For 
each point ( m , n) we have a basis element B(m, n) defined as follows: 

{(/n, n)} if m and n are odd, 

r>( \ _ {(m +a,ri) | a = —1,0,1} if m is even and n is odd, 

bn, n) — ^ m n + b ) | b = -1,0,1) if m is odd and n is even, 

. {(m +a,n + b) \ a,b — —1, 0 ,1 } if m and n are even. 

The basis elements B(m , n) = {{m, n)}, where m and n are both odd, are 

the single-point open sets representing the pixels in the digital image display. In 
Figure 1.21 we show some of the basis elements for the digital plane topology. 

The collection Bp = {B(m, n) \ (m, n) eZxZjisa basis for a topology 
on Z x Z. (See Exercise 1.38.) The resulting topological space is called the 
digital plane. In this topology, the single-point sets {( m , n)} are closed sets if 


64 Chapter 1 Topological Spaces 


0 


0-0 


-»IQ • 0 


0 


0 




O-G 




0-0 


r"\ 


-5 -4 -3 -2 -1 0 1 2 3 4 5 6 7 


FIGURE 1.21: Some basis elements for the digital plane. 

m and n are both even. (See Exercise 1.39.) Furthermore, like the digital line, 
the digital plane is not a Hausdorff space. 

As illustrated in Figure 1.22, we view the digital plane as a set of 

(i) Open pixels corresponding to the points (m, n) where m and n are 
both odd, 

(ii) Vertical and horizontal boundaries between the pixels correspond¬ 
ing to the points ( m , n) where either m or n is even and the other 
is odd, 

(iii) Closed comer boundary points corresponding to the points ( m,n) 
where both m and n are even. 



FIGURE 1.22: The digital plane models a rectangular array of pixels and the bound¬ 
aries between them. 

We explore the digital plane further in Section 11.3. 

Phenotype Spaces 

The genotype-phenotype relationship is of fundamental importance in biol¬ 
ogy. The genotype is internally coded, inheritable information possessed by 
all living organisms, while the phenotype is the physical realization of that 
information. For example, the collection of genes responsible for eye color 
in a particular individual is a genotype. The observable eye coloration in 





















Section 1.4 Examples of Topologies in Applications 65 


the individual is the corresponding phenotype. In this section, we examine a 
model of evolutionary proximity established by defining a topology on a set of 
phenotypes. Molecular biologists propose this model as a means for formally 
defining continuous and discontinuous evolutionary change, providing a math¬ 
ematical framework for understanding evolutionary processes. (See [Fonl], 
for example.) 

We demonstrate the idea of a phenotype space via an example involv¬ 
ing ribonucleic acid (RNA) molecules. Strands of RNA are formed from 
smaller molecules, called nucleotides, that bond together to make strong, flex¬ 
ible chains. There are four different nucleotides in RNA: guanine (G), cytosine 
(C), adenine (A) and uracil (U). Nonadjacent nucleotide pairs undergo addi¬ 
tional (weaker) bonding, contorting the chain into a more complicated folded 
arrangement, as illustrated in Figure 1.23. 

GUGAUGGAUU AGGAUGUCCU ACUCCUUUGC UCCGUAAGAU AGUGCGGAGU UCCGAACUUA CACGGCGCGC GGUUAC 



FIGURE 1 23: RNA is a folded and bonded chain of nucleotides. 

Pairings of guanine with cytosine and adenine with uracil contribute the 
most to folding, though guanine and uracil can also pair. In theory, there are 
many ways that a particular nucleotide chain can fold and bond, but only the 
most energetically favorable of these are likely to occur. For simplicity, we 
assume that this bonding occurs in a unique way. 

Here, we consider only relatively short nucleotide chains to illustrate the 
structures and the topological model. RNA molecules in living cells can be 
anywhere from tens to thousands of nucleotides in length. 

For a particular RNA molecule, the associated unfolded nucleotide chain is 
called the primary structure. We represent the nucleotide chain by a sequence 
of C’s, G’s, A’s and U’s, called the genotype sequence for the molecule. 

Given a genotype sequence, its bonding diagram is an unlabeled diagram 
depicting the bonding that occurs in the resulting RNA molecule. In Figure 1.24 
we illustrate the bonding diagram for the RNA molecule in Figure 1.23. The 
dot on the diagram refers to the location of the first nucleotide in the sequence, 
and the rest of the sequence wraps counterclockwise around the perimeter of 
the diagram. The bonding diagram is the phenotype in our model; it is also 
referred to as the RNA shape or secondary structure for the RNA molecule. 
The phenotype spaces we introduce are sets of RNA shapes on which a topology 
is defined. 
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EXAMPLE 1.19. Consider the following three genotype sequences: 

1. GGGCAGUCUC CCGGCGUUUA AGGGAUCCUG AACUUCGUCG 

CUCCCAUCCA AUCAGUCCGC CUCACGGAUG GAGUUG 

2. GGGCAGUCUC CCGGCGUUUA AGGAAUCCUG AACUUCGUCG 

CUCCCAUCCA AUCAGUCCGC CUCACGGAUG GAGUUG 

3. GGGCAGUCUC CCGGCCUUUA AGGGAUCCUG AACUUCGUCG 

CUCCCAUCCA AUCAGUCCGC CUCACGGAUG GAGUUG 

The bonding diagrams for these three genotype sequences are shown in Fig¬ 
ure 1.25. 



FIGURE 1.25: Bonding diagrams for the nucleotide sequences. 

Sequences 1 and 3 are identical except for the 16th entry, and the bonding 
diagram is the same for both. On the other hand, sequence 2 differs from 
sequence 1 in only one entry, the 24th, but the corresponding bonding diagrams 
are very different. 


In our model there is a unique RNA shape associated to each genotype se¬ 
quence. This relationship is not one-to-one since multiple genotype sequences 
can result in the same RNA shape, as we see in the previous example. An 
important aspect of this theory is the notion that some single-entry changes 
in the genotype sequence completely alter the bonding diagram while other 
changes do not. 
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DEFINITION 1.20. The set of all genotype sequences that result in a par¬ 
ticular RNA shape s is called the neutral network of s and is denoted N(s). 


EXAMPLE 1.20. Consider the genotype sequences of length 10 made up of 
guanine (G) and cytosine (C) only. There are 2 10 = 1024 possible genotype 
sequences, and, upon folding and bonding, they result in eight different RNA 
shapes. The RNA shapes are illustrated in Figure 1.26, along with the number 
of different genotype sequences associated to each (i.e., the size of N(s) for 
each 5 ). We denote this set of RNA shapes by GCiq. 
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FIGURE 1.26: The set GC 10 . 


In a general biological setting genetic mutation is a random process by 
which the genotype changes. As a result of the many-to-one nature of the 
genotype-phenotype mapping, many genetic mutations do not alter the result¬ 
ing phenotype. On the other hand, there are occasional mutations that result in 
a new phenotype, and then the process of natural selection determines whether 
or not the new phenotype persists. 

Now, suppose that we have a specific genotype sequence, and let r be 
its corresponding RNA shape. In this setting, a genetic mutation is a random 
change in the entries in the genotype sequence. As these random changes occur, 
we may remain within the neutral network of r, “drifting” through genotype 
sequences that all result in RNA shape r. At some point, however, a change 
in one entry in a genotype sequence might take us out of the neutral network 
of r into that of another RNA shape s. We say that r has mutated to s. (See 
Figure 1.27.) 

Given RNA shapes r and s, we would like to know how likely it is for r to 
mutate to 5 as a result of a single-entry change in a genotype sequence in the 
neutral network of r. We can define and quantify a probability to make this 
precise. We will use this probability to define phenotype-space topologies on 
sets of RNA shapes. 

Before defining and using the probability, however, we introduce several 
quantities that play a role in its definition. By a point mutation we mean 
a mutation from one genotype sequence to another obtained by changing a 
single entry in the sequence. In Example 1.19 sequence 2 is obtained by a 
point mutation from sequence 1, and vice versa (similarly for sequences 1 
and 3). 
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FIGURE 1 27: Genetic mutation from secondary structure r to secondary structure 
s. 


For RNA shapes r and s, let m r<s be the number of point mutations that 
change a sequence in N(r) to a sequence in N(s) and thus result in a mutation 
from r to s. Note that m r>s = m s , r because each point mutation changing a 
sequence in N(r) to one in N(s) has a corresponding inverse point mutation 
that changes a sequence in N(s ) to one in N(r). Let be the number of 
point mutations that change a sequence in N(r) to a sequence in any other 
neutral network. We can think of * as the number of point mutations that 
take us out of N(r). 

DEFINITION 1.21. The mutation probability, p rs , is defined by 

m r , s 

Pr.s = -• 

m r ,* 

Even though m r<s — m s>r , it need not be the case that p r , s = p s<r since 
the values of m r ^ and m s might be different. For example, if is greater 
than m s then there are more point mutations out of N(r) than there are out of 
N(s), and therefore the proportion of mutations is smaller than We 
will see this asymmetry reflected in the phenotype-space topologies—while 
RNA shape s might be close to RNA shape r, r need not be close to s. 

Because of the asymmetry in these probabilities, a distance function cannot 
provide a means of determining the proximity of two RNA shapes. A distance 
function (or metric, as discussed in Chapter 5) is necessarily symmetric; the 
distance between r and s must equal the distance between s and r. Since a 
distance function will not serve for this purpose, a topological space is a natural 
alternative to consider. 


EXAMPLE 1.21 . In order to clarify how the asymmetry arises in the mutation 
probabilities, in this example we consider probabilities that are defined like the 
mutation probabilities, but in a very different setting. Consider the following 
scenario. In Figure 1.28 we show the six New England states and a table 
of transition probabilities (defined subsequently) between them. For a pair 
of states R and S, we define Brs to be the length of the border between 
the two states. Here we have Bs,r = Br s for each pair of states R and S. 
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FIGURE 1.28: Transition probabilities between the New England states. 

For an individual state R, we define Br* to be the total length of the border 
between R and all of the other New England states. We set the transition 
probability Pr t s equal to Br,s/Br,* and interpret this as the probability that 
a random step out of state R into another New England state lands in state S. 
The probability Pr,s is not symmetric. Since the only state bordering Maine 
is New Hampshire, Pme,nh = E but Pnh.me i 1 1 since New Hampshire 
also borders Massachusetts and Vermont. This example is explored further in 
Exercise 1.44. 


For an explicit example, we consider a topology on GC\o, the set of RNA 
shapes of genotypes of length 10. The forthcoming process used in defining 
a topology from mutation probabilities carries over to RNA shapes associated 
with genotype sequences of any fixed length. 

We do not discuss the details of how the numbers of mutations m r>s and 
are determined for the RNA shapes in GC\q. For our starting point, we 
use the associated mutation probabilities in Figure 1.29, which are derived 
from numbers of mutations as previously described. 
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FIGURE 1.29’ The mutation probabilities for GCio- 
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In Figure 1.29 the entry in the ith row and jth column is the probability 
that a point mutation out of the neutral network of s, results in a sequence in 
the neutral network of sj. 

The notion of proximity that we define on the set of RNA shapes is based 
on the likelihood of a mutation from one RNA shape to another. Since there 
are eight RNA shapes in GC io, each can potentially mutate to seven others. 
Hence if pij > 1 /7, we think of 5* as having more than the average likelihood 
of mutating to Sj. 

For each i — 1,.... 8, define R,- = { 5 ,} U { 5 / | pij > 1/7}. Thus /?, 
consists of Si along with all of the RNA shapes to which s,- has more than the 
average likelihood of mutating. The collection 1Z\p = {/?j}f =1 is not itself 
a topology, but we extend it to one, defining 7j p to be the minimal topology 
on GC io containing Ti-\p. The topology T\p is generated by a basis formed 
by taking finite intersections of the sets in 1Z\p. (See Exercise SE 1.20.) The 
resulting topological space is referred to as a phenotype space. 

A topology on a finite set has a unique minimal basis that generates the 
topology. (See Exercises SE 1.23 and SE 1.24.) If for each 5,- we take all of 
the sets Rj that contain s, , and let be their intersection, then the collection 
B\p = is the minimal basis for 71/7. (See Exercise SE 1.24.) 

From the probability table for GC\q , it is easy to determine this minimal 
basis B\p. To begin, as shown in Figure 1.30, we put a check mark in each 
diagonal table entry and in each table entry where pij > 1 /l. 



s l 

s 2 

s 3 

54 

s 5 

s 6 

s i 

S s 

*1 

/ 


y 





y 

s 2 


y 

y 



y 


y 

s 3 


y 

y 





mm 

54 

/ 



/ 




y 

5 5 


y 



/ 



y 

*6 


y 




y 


y 

s 7 







y 

y 

s S 

/ 

y 

y 


/ 



y 


FIGURE 1.30: The GCio probability-table entries with p, j > 1/7. 

For each i, the checkmarks in row i correspond to the elements in /?,. For 
example, R\ — {si, 53 , 58 } and R 2 = {^ 2 . 53 , s&, 5 s}. To determine the basis 
element 2 ?,- we take the intersection of the rows that contain a check mark in 
the ith column. For example, the column is checked in the second and sixth 
rows. Intersecting R 2 and R(, results in = {s 2 , $ 6 > ^s}- The basis Bip is 
illustrated in Figure 1.31. 

Notice that s% is close to each sj in the sense that the basis element B } 
contains s% and therefore so does every neighborhood of sj. This reflects the 
fact that each sj mutates to 58 with a relatively high probability. On the other 
hand, B$ contains only 58, and therefore no other sj is as close to 58 as 58 is 
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FIGURE 1.31: The basis B\p for the topology T\p. 


to it. While it is a topological consequence that B% = {ss}, in this particular 
example having B$ = {.sg J nicely reflects the fact that the neutral network of 
$8 is large in comparison to the neutral networks of the other Sj ’s and therefore 
mutation out of the neutral network of 58 is less frequent than mutation into it. 

We used the average probability 1/7 as a threshold in constructing Tj/ 7 . 
That is, we defined sets Rj = {s,-} U {sj | pij > 1/7}, and we set T\p 
equal to the minimal topology on GC io containing all of the sets We 
could work with other probability thresholds and possibly obtain different 
topologies on the same set of RNA shapes. For example, if we choose 1/10 as 
our threshold and let TZ\/\o, Tj/io, and B\/\q be defined in the same manner 
as the corresponding collections above, then the basis B\/\q appears as in 
Figure 1.32. (See Exercise 1.42.) 

Researchers hope that establishing a notion of proximity using topology 
will help to clarify the biological processes underlying the evolution of living 
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FIGURE 1.32: The basis B\/\q for the topology Tj/io- 
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organisms. The simple model presented here, using RNA shapes, can carry 
over to more general and complex genotype-phenotype systems. Proximity 
is an important component of continuity, in both real-world and topological 
terms. (See Chapter 4). Consequently, phenotype spaces provide an appro¬ 
priate setting for modeling and investigating continuous and discontinuous 
evolutionary change. 

Exercises for Section 1.4 

1.38. Show that the collection Bp of subsets of Z x Z, used in defining the digital 
plane, is a basis for a topology on Z x Z. 

1.39. For each point (m, n) in the digital plane, determine the smallest closed set 
containing (m , n). (There are four cases to consider: m and n both odd, m odd 
and n even, m even and n odd, and m and n both even.) 

1.40. Suppose our genotypes are four-element sequences made up of the letters N 
and S, and we have three phenotypes with neutral networks: 

Ni = {(N,N,N,N),(N,N,N,S),(N,N,S,N),(N,N,S,S)} 

N 2 = [(N, S, N, N), (JV, S, N, S), (N, S, S, N), (N, S, S, S), 

(S, N, N, AO, ( S, N, N, S)} 

N 3 = [(S, N, S , AO, (S, N, S, S), (S, S, N, N), (5, S, N, S), 

(S, S, 5, AO, (S, S, S, S)} 

Determine the mutation probabilities p\ <2 , p 1 , 3 , P 2 , 3 . P2,\, P 3 ,\, and p 3t2 . 

1.41. Suppose our genotypes are two-element sequences made up of the letters R, D, 
and F, and we have three phenotypes with neutral networks: 

Ni = {(R,R)} 

N 2 = {(*, D). (F, R), (F, F)} 

Ns = {(£>, R), (F, D), (Z), D), (F, F), (D, F)} 

Determine the mutation probabilities p 1 , 2 , p 1,3 ,P 2 . 3 . Pi.i, P 3 ,i. and p 3t2 . 

1.42. (a) Show the steps in determining B 1/7 on GC 10 from the probability table in 

Figure 1.29. 

(b) Determine B\/\q on GC 10 from the probability table in Figure 1.29. 

1.43. What is the minimum value of p such that T p on GC 10 is the discrete topology? 
Justify your result. 

1.44. Consider the six New England states and the associated probability table in 
Figure 1.28. Let NE = {CT, MA, ME, NH, RI, VT), 

(a) Determine the minimal basis for the topology 7o on NE. In this topology 
we are considering pairs of states, R and S , where a random step out of 
state R has a nonzero probability of landing in state S (that is, a step out 
of R can land in S). 

(b) Determine the minimal basis for the topology Tj /3 on NE. 



Chapter 2 

Interior, Closure, 
and Boundary 


In this chapter, we consider some important sets associated with the subsets 
of a topological space. In particular, we introduce the interior and closure of 
a set in Section 2.1, the limit points of a set in Section 2.2, and the boundary 
of a set in Section 2.3. In Section 2.4, we close this chapter with a topological 
modeling application in geographic information systems. 

2.1 Interior and Closure of Sets 

An arbitrary subset A of a topological space might be neither open nor closed. 
However, it is often useful to associate a related open set or a related closed set 
to A. In particular, we can sandwich each set A between the largest open set 
contained in A and the smallest closed set containing A. These sets are known 
as the interior of A and the closure of A, respectively. 

DEFINITION 2.1 . Let Abe a subset of a topological space X. The interior of 

o 

A, denoted A or_Int(A), is the union of all open sets contained in A. The closure 
of A, denoted A or C/(A), is the intersection of all closed sets containing A. 

Clearly, the interior of A is open and a subset of A, and the closure of A is 
closed and contains A. Thus we have the aforementioned set sandwich, with 
A caught between an open set and a closed set: A c A c A. 

The following properties follow readily from the definition of interior and 
closure. 

THEOREM 2.2. Let Xbea topological space and A and B be subsets ofX. 

(i) If U is an open set in X and U C A, then U C Int(A). 

(ii) IfC is a closed set in X and A C C, then C/(A) C C. 

(iii) If A c B then Int(A) C Int(B). 

(iv) If Ac B then Cl(A ) c Cl{B). 

(v) A is open if and only if A = Int(A). 

(vi) A is closed if and only if A = Cl{A). 

We prove (i), (iii), and (v) here. For (ii), (iv), and (vi) see Exercise 2.2. 

Proof of (i). Suppose that U is an open set in X and U C A. Since 
Int(A) is the union of all of the open sets that are contained in A, it 
follows that U is one of the sets making up this union and therefore is a 
subset of the union. That is, U C Int(A). ■ 
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Proof of (iii). Since A c B, Int(A) is an open set contained in B. Part 
(i) implies that every open set contained in B is contained in Int(B). 
Therefore Int(A) C Int(B). B 

Proof of (v). If A = Int(A), then A is an open set, since by definition 
Int(A) is an open set. 

Now assume that A is open. We show that A = Int(A). First, 
Int(A) C A by definition of Int(A). Furthermore, since A is an open set 
contained in A, it follows by (i) that A c Int(A). Thus A = Int(A) as 
we wished to show. B 

Theorem 2.2(i) indicates that every open set contained in A is contained 
in the interior of A. In this way Int(A) is the largest open set contained in 
A. Similarly, Theorem 2.2(ii) indicates that every closed set containing A also 
contains the closure of A, and thus C1(A) is the smallest closed set containing 
A. We will find these perspectives on interior and closure very useful when 
establishing properties of these sets. 


EXAMPLE 2.1. Consider A = [0,1) as a subset of R with the standard 
topology. Then A = (0, 1) and A = [0,1]. 


EXAMPLE 2.2 ; Consider A = [0,1) as a subset of R with the discrete 
topology. Then A = A = [0, 1). 


EXAMPLE 2.3. Consider A = [0,1) as a subset of R in the finite complement 
topology. Here A = 0 because there are no nonempty open sets contained in 
[0, 1). Since A is infinite, and the only infinite closed set in this topology is R, 
it follows that A = R. 


EXAMPLE 2.4. Consider A = [0,1) as a subset of R in the lower limit 
topology. Here, A = A since A is an open set. Note that R — [0, 1) = 
(—oo, 0) U [ 1, oo) is an open set in R with the lower limit topology. Therefore 
[0, 1) is closed in the lower limit topology, implying that A = A as well. 


IMPORTANT NOTE: As Examples 2.1-2.4 demonstrate, the interior and 
closure of a set A depend on the topology on the set X containing A, not just 
on the set A. 


EXAMPLE 2.5. Consider the set of rational numbers Q in R with the standard 
topology. We claim that Q = 0. Assume it is not, and suppose that U is a 
nonempty open set contained in <Q>. Let x be an element in U. Then there is 
an open interval (a, b ) such that x e (a, b) C U C Q. But between every 
pair of real numbers there is an irrational number. Thus every interval contains 
elements of R — Q, and therefore so does U. This is a contradiction; hence 
Q = 0. 
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While the interior of <Q> contains nothing, if we take the closure we get 
everything; specifically, Q = E (see Exercise 2.6). 


DEFINITION 2.3. A subset A of a topological space X is called dense if 
Cl(A ) = X. 

By Example 2.5, it follows that Q is dense in E with the standard topology. 


EXAMPLE 2.6. In the finite complement topology on E, every infinite set is 
dense. Why? In this topology, the closed sets are either finite sets or E itself. 
Therefore E is the only closed set containing an infinite set. Thus, if A is an 
infinite subset of E, then C1(A) = E, implying that A is dense in E. 


The following two theorems provide simple means for determining when 
a particular point y is in the interior or in the closure of a given set A: 

THEOREM 2.4. Let X be a topological space, Abe a subset of X, and y be 
an element ofX. Then y e Int(A) if and only if there exists an open set U such 
that y e U C A. 

Proof. First, suppose that there exists an open set U such that y € U C 
A. Then, since U is open and contained in A, it follows that U C Int(A). 
Thus y € U implies that y 6 Int(A). 

Next, if y € Int(A), and we set U = Int(A), it follows that U is an 
open set such that y € U C A. ■ 

THEOREM 2.5. Let X be a topological space, A be a subset of X, and y be 
an element of X. Then y € CZ(A) if and only if every open set containing y 
intersects A. 

Proof. See Exercise 2.10. ■ 


-1 

EXAMPLE 2.7. In E 2 with the standard topology, let A be the lollipop that 
appears on the left in Figure 2.1. Then Int(A) appears as in the middle of the 
figure, and C1(A) appears as on the right. 



FIGURE 2.1: The interior and closure of a set A in K 2 . 

_i 
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The following theorem provides some useful relationships involving inte¬ 
rior, closure, intersection, union, and complement: 


THEOREM 2.6. For sets A and B in a topological space X, the following 
statements hold: 

(i) Int(X — A) = X — Cl(A). 

(ii) Cl(X — A) = X — Int(A). 

(iii) Int(A) U Int(B) c Int(A U B), and in general equality does not 
hold. 

(iv) Int{A) fl Int(B) = Int(A Pi B). 


We prove (i) and (iii) here. For proofs of (ii) and (iv) see Exercise 2.11. In 
addition, Exercise 2.12 addresses two results involving closure that correspond 
to (iii) and (iv). 


Proof of (i). We prove that X - C1(A) c Int(X - A) and Int(X — A) c 
X — C1(A). First, note that Cl(A) is closed and contains A, and therefore 
X — Cl( A) is an open set contained in X — A . It follows by Theorem 2.2(i) 
that X - C1(A) c Int(X - A). 

To prove that Int(X — A) C X — Cl (A), let x e Int(X — A) be 
arbitrary. Note that Int(X — A) is disjoint from A, and therefore x is 
in an open set that is disjoint from A. By Theorem 2.5, it follows that 
x £ C1(A); hence, x eX- C1(A). Thus, Int(X - A) c X - C1(A). 

Since we have shown that both X — C1(A) C Int(X — A) and 
Int(X - A) c X - C1(A) hold, we now have Int(X - A) = X - C1(A), 
as we wished to show. ■ 


Proof of (iii). Since Int(A) c A, it follows that Int(A) c A U B. Also, 
Int(A) is an open set. Similarly, Int(fi) is an open set contained in A U B. 
Every open set contained in A U B is contained in Int(A U B). Therefore 
both Int(A) and Int(fi) are contained in Int(A U B). Hence their union, 
Int(A) U Int(B), is also contained in Int(A U B). 

Now we must show that there are cases where Int(A) U Int(B) does 
not equal Int(A U B). Take A = [—1,0] and B = [0, 1] as subsets of R 
with the standard topology. Then Int(A) U Int(B) = (— 1,0) U (0, 1), but 
Int(A U B) = (—1, 1). Thus, in this case Int(A) U Int(B) ^ Int(A U B). 
(See Figure 2.2.) ■ 
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FIGURE 2.2: Int(A) U Int(B) ^ lnt(A U B). 


EXAMPLE 2.8. Let A be the set of irrational numbers as a subset of R with the 
standard topology. We show that A is dense. Note that A = R — Q where Q is 
the set of rational numbers. Therefore C1(A) = C1(R — Q) = R — Int(Q), with 
the second equality holding by Theorem 2.6(ii). In Example 2.5 we showed 
that Int(Q) = 0. Thus C1(A) = R, implying that A is dense in R. 


Exercises for Section 2.1 

2.1. Determine Int(A) and C1(A) in each case. 

(a) A = (0,1] in the lower limit topology on R. 

(b) A = {a} in X = { a , b, c} with topology [X, 0 , {a}, [a, b}}. 

(c) A = {a, c] in X = {a, b, c] with topology {X, 0, {a}, {a, b}}. 

(d) A = {b} in X = {a, b, c] with topology [X, 0 , (a), {a, b}}. 

(e) A = (—1,1) U {2} in the standard topology on R. 

(f) A = (— 1,1) U {2} in the lower limit topology on K. 

(g) A = {(x, 0) e R 2 | x e R} in R 2 with the standard topology. 

(h) A = {(0, x) € R 2 | x e R} in R 2 with the topology generated by the basis 
in Exercise 1.19. 

(i) A = {(x, 0) € R 2 | x e R} in R 2 with the topology generated by the basis 
in Exercise 1.19. 

2.2. Prove Theorem 2.2, parts (ii), (iv), and (vi): Let X be a topological space 
and A and B be subsets of X. 

(a) If C is a closed set in X and A c C, then C1(A) c C. 

(b) If Ac B then C1(A) c Cl(£). 

(c) A is closed if and only if A = Cl(A). 

2.3. For m < n € Z, let / m ,„ = {m, m + 1,...,«}. Determine Int(/ m , n ) and 
Clin the digital line topology. (There are four cases to consider, allowing 
m to be either even or odd, as well as n.) 

2.4. Consider the particular point topology PPX p on a set X. (See Exercise 1.7.) 
Determine Int(A) and C1(A) for sets A containing p and for sets A not contain¬ 
ing p. 

2.5. Consider the excluded point topology EPX p on a set X. (See Exercise 1.8.) 
Determine Int(A) and C1(A) for sets A containing p and for sets A not contain¬ 
ing P- 
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2.6. Prove that C1(Q) = R in the standard topology on R. 

2.7. Let B = {jreR|aeZ, n € Z + ). Show that £ is dense in R. 

2.8. (a) Show that the set of odd integers is dense in the digital line topology on 

Z. Is the same true for the set of even integers? 

(b) Which subsets of Z are dense in the discrete topology on Z? 

2.9. InR 2 with the standard topology, prove that Cl ((a, b) x (c, d)) = [a, b] x [c, d\ 
and Int([a, b ] x [c, d]) — (a, b ) x (c, d). 

2.10. Prove Theorem 2.5: Let X be a topological space, A be a subset of X, and y 
be an element of X. Then y e C1(A) if and only if every open set containing y 
intersects A. 

2.11. Prove Theorem 2.6, parts (ii) and (iv): For sets A and £ in a topological 
space X, the following hold: 

(a) C1(X - A) = X - Int(A). 

(b) Int(A) n Int(£) = Int(A D £). 

2.12. In each case, determine whether the relation in the blank is c, D. or =. In 
cases where equality does not hold, provide an example indicating so. 

(a) Cl(A)nCl(fl)_Cl(ADfl) 

(b) C1(A) U Cl(£)_C1(A U £) 


2.2 Limit Points 

In the standard topology on 1", a limit point of a subset appears as what we 
might expect; that is, there is a sequence of points in the subset approaching 
the limit point. This is a very useful concept. However, we make the definition 
of limit point precise via neighborhoods rather than sequences. In topological 
spaces that are not quite as well-behaved as R", we will see that this allows for 
some unusual, and perhaps counterintuitive, possibilities. 

DEFINITION 2.7. Let Abe a subset of a topological space X. A point x in 
X is a limit point of A if every neighborhood of x intersects A in a point other 
than x. 

Notice that a limit point x of a set A may or may not lie in the set A. Notice 
also that in every topology, the point x is not a limit point of the set {.t}. 


EXAMPLE 2.9. Consider the set A = [~ e R | n e Z+} as a subset of R 
with the standard topology. It is illustrated in Figure 2.3. 

- ( — «....—. ----- 


FIGURE 2.3: The set A = e R | n e Z+}. 

The point 0 is a limit point of A. Why? If U is an open set containing 0, 
then there is an interval (a, b ) such that 0 e (a, b) C U. But (a, 6) fl A ^ 0 
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for every such open interval. So U n A ^ 0. Therefore every neighborhood 
of 0 intersects A, and since 0 is not in A, the intersection contains points other 
than 0. It follows that 0 is a limit point of A. 

In fact, 0 is the only limit point of A. Given x e R — {0}, we can find an 
open interval containing x that either is disjoint from A (if x g A) or intersects 
A only in x (if x e A). In either case, if x 0, then x is not a limit point of A. 


EXAMPLE2.10. Consider(0, 1] as a subset ofR with the standard topology. 
What are the limit points of (0, 1]? Here, every x e [0, 1] is a limit point 
of (0,1]. Every neighborhood U of x e [0, 1] contains an interval (a.b) 
containing x. Such an interval intersects (0, 1] in a point other than x, and 
therefore x is a limit point of (0, 1]. 

Furthermore, if x g [0, 1], then there are open intervals containing x that 
are disjoint from (0, 1]. Therefore if x g [0, 1], then x is not a limit point of 
( 0 , 1 ]. 

Thus [0, 1] is the set of limit points of (0, 1]. 


EXAMPLE 2.11 . Consider the rational numbers Q as a subset of R with the 
standard topology. Every x e R is a limit point of <Q>. Why? Given a real 
number x, an open set U containing x contains an open interval ( a , b) that also 
contains x. But every open interval intersects Q in infinitely many points, and 
therefore (a, b) intersects Q in a point other than x. Hence, x is a limit point 
of Q. 


Limit points provide us with an easy means to find the closure of a set. 

THEOREM 2.8. Let A be a subset of a topological space X, and let A! be 
the set of limit points of A. Then Cl(A ) — AU A'. 

Proof. We show that AUA'c C1(A) and C1(A) CAUA'. 

First we show that A U A' C C1(A). Certainly, A c C1(A), so all 
we need to show is that A' c C1(A). Suppose x e A’. Then every 
neighborhood of x intersects A. By Theorem 2.5, x e C1(A), and it 
follows that A' C C1(A). Thus AUA'c C1(A). 

Now consider C1(A) C A U A'. Suppose x e C1(A). Either x € A 
or x € C1(A) — A. In the former case, it follows that x e A U A'. 
Consider the latter case, x € C1(A) — A. Since x e C1(A), Theorem 2.5 
implies that every open set containing x intersects A. Since x & A, 
such an intersection must contain a point other than x. Thus x is a limit 
point of A, and it follows that x e A U A'. In either case (x e A or 
x € C1(A) — A) we have x e A U A', implying that C1(A) c A U A', 
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COROLLARY 2.9. A subset A of a topological space is closed if and only if 
it contains all of its limit points. 

Proof. By Theorem 2.2(vi), A is closed if and only if A = C1(A). By 
Theorem 2.8, A = Cl (A) if and only if A = A U A' where A' is the set 
of limit points of A. Finally, A = A U A' holds if and only if A' C A. 
Thus A is closed if and only A' c A, as we wished to show. ■ 


EXAMPLE 2.12. The infinite comb C was introduced in Exercise 1.27 and is 
shown here in Figure 2.4. The set C consists of the horizontal segment running 
from (0,0) to (1,0) on the Jt-axis, along with the vertical segments running 

from (^tt, 0) to (^r, 1), for each n = 0, 1, 2,_ 

If we regard C as a subset of M 2 in the standard topology, every point 
of C is a limit point of C, as are the points in the vertical line segment Y = 
{(0, y) i 0 < y < 1}. As illustrated in Figure 2.4, every neighborhood of each 
point (0, y) € Y intersects vertical segments making up C. Besides the points 
in C U Y, there are no other limit points of C, so C1(C) = C U T. 



FIGURE 2.4: Points (0, y) with 0 < y < 1 are limit points of the infinite comb. 


A concept closely related to limit points is that of a convergent sequence. 

DEFINITION 2.10. In a topological space X, a sequence {x\,X 2 , ...) con¬ 
verges to x € X if for every neighborhood U of x, there is a positive integer 
N such that x n e U for all n > N. We say that x is the limit of the sequence 
Qci, X 2 , ■ ■ ■), and we write 

lim x n = x. 
n->oo 

The idea behind a sequence converging to a point x is that, given any 
neighborhood U of x, the sequence eventually enters and stays in U. 


EXAMPLE 2.13. Consider the sequence given by x n = in R, as illus¬ 
trated in Figure 2.5. In the standard topology on R, the sequence converges 
to 0 since every neighborhood U of 0 contains an open interval (—a, a) into 
which the sequence eventually enters and stays. (See Exercise 2.22.) 
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FIGURE 2.5: The sequence x„ = 

On the other hand, if we consider the lower limit topology on E, this 
sequence does not converge to 0. Take, for example, the neighborhood [0,1) 
of 0. The sequence enters this neighborhood, but never stays in it since every 
other term jumps out, taking on a value less than 0. In fact, in the lower limit 
topology on E, this sequence does not converge at all. 


In the standard topology on E", limit points are limits of sequences, as the 
following theorem indicates: 

THEOREM 2.11. Let Abe a subset o/E” in the standard topology. If x is 
a limit point of A, then there is a sequence of points in A that converges to x. 

Proof. See Exercise 2.20. ■ 


In a general topological space X, a limit point of a set B c X is not 
necessarily the limit of a sequence in B. (See Exercise 2.23.) 

Our intuition might suggest that if a sequence converges to a point, then that 
point should be unique. That is the case in the standard topology on E”. But 
in other topologies this need not be so. For example, in the finite complement 
topology on E, every sequence with an infinite range converges to every point 
in E. (See Exercise 2.19.) 

Such a situation cannot occur in a Hausdorff space. The next theorem 
presents another of the convenient properties of a Hausdorff space. 


THEOREM 2.12. IfX is a Hausdorff space, then every convergent sequence 
of points in X converges to a unique point in X. 

Proof. Let A - be a Hausdorff space. Suppose that a sequence (*i, * 2 ,...) 
converges to two different points, x and y, in X. Since X is Hausdorff, x 
and y have disjoint neighborhoods U and V, respectively. The sequence 
(x\, X 2 , ■ ■ ■ ) converges to x; therefore there exists N e Z+ such that 
x n e U for all n > N. Similarly, there exists Me Z+ such that x n e V 
for all n > M. Thus, if m is greater than both N and M, then x m e U HV, 
contradicting the fact that U and V are disjoint. Therefore convergent 
sequences in a Hausdorff space converge to a unique point. B 

Exercises for Section 2.2 

2.13. Determine the set of limit points of A in each case. 

(a) A = (0,1] in the lower limit topology on M. 

(b) A = {a} in X = [a, b, c] with topology {X, 0, {a}, [a, b}}. 

(c) A = [a, c} in X = {a, b , c } with topology {X, 0, {a}, [a, b}}. 

(d) A = {b} in X = {a, b, c] with topology {X, 0, [a], [a, b}}. 
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(e) A = (— 1, 1) U {2} in the standard topology on R. 

(f) A = (— 1,1) U {2} in the lower limit topology on R. 

(g) A = {(jc, 0) e R 2 | x e R) in R 2 with the standard topology. 

(h) A = {(0, jc) e R 2 | jc € R} in R 2 with the topology generated by the basis 
in Exercise 1.19. 

(i) A = {(x, 0) e R 2 | x € R} in R 2 with the topology generated by the basis 
in Exercise 1.19. 

2.14. For each n e Z + , let B„ = {«, n + 1, n + 2,...}, and consider the collection 
B={B n \neZ+}. 

(a) Show that B is a basis for a topology on Z + . 

(b) Show that the topology on X generated by B is not Hausdorff. 

(c) Show that the sequence (2,4,6, 8,...) converges to every point in Z + 
with the topology generated by B. 

(d) Prove that every sequence with infinite range converges to every point in 
Z + with the topology generated by B. 

2.15. Determine the set of limit points of [0,1] in the finite complement topology 
on R. 

2.16. Determine the set of limit points of the single-point set [n] in the digital line 
topology. (The result depends on whether n is odd or even.) 

2.17. (a) Let B — {[a, b) c R | a, b e Q and a < b }. Show that B is a basis for a 

topology on R. The resulting topology is called the rational lower limit 
. topology and is denoted R r /. 

(b) Determine the closures of A = (0, \/2) and B = (\/2, 3) in R/ and in R r i- 

2.18. Determine the set of limit points of A = + £ e R | m, n 6 Z+} in the 

standard topology on R. 

2.19. Show that if (x n ) is a sequence in M having infinite range, then (x n ) converges 
to every point in R with the finite complement topology on R. 

2.20. Prove Theorem 2.11: Let A be a subset of R" in the standard topology. If x is 
a limit point of A, then there is a sequence of points in A that converges to x. 

2.21. Determine the set of limit points of the set 

5 = {(x, sin(i)) e R 2 | 0 < x < 1) 

as a subset of R 2 in the standard topology. (The closure of S in the plane is 
known as the topologist’s sine curve.) 

2.22. Consider the sequence defined by x n = (r fi 1> '' in R with the standard topology. 

(a) Prove that every neighborhood of the point 0 contains an open interval 
(—a, a). 

(b) Prove that for each open interval (—a, a), there exists N e Z + such that 
x n € (—a, a) for all n > N. 

2.23. Let T be the collection of subsets of R consisting of the empty set and every 
set whose complement is countable. 

(a) Show that T is a topology on R. (It is called the countable complement 
topology.) 

(b) Show that the point 0 is a limit point of the set A = R — {0} in the countable 
complement topology. 

(c) Show that in A = R — {0} there is no sequence converging to 0 in the 
countable complement topology. 
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2.3 The Boundary of a Set 

We have an intuitive idea of what we mean by the boundary of a set: the points 
that lie close to both the inside and the outside of the set. (See Figure 2.6.) But 
as we have seen, there are examples and topologies out there that challenge 
or defy our intuition. For example, what is the boundary of Q as a subset of 
R in the standard topology? Also, what is the boundary of the closed interval 
[0,1] as a subset of R in the finite complement topology? The definition of 
the boundary of a set has to be chosen carefully and understood clearly for its 
proper use as a central concept in topology. 


The 

boundary 
of A 



FIGURE 2.6: The boundary of a set. 

DEFINITION 2.13. Let A be a subset of a topological space X. Theboundary 
of A, denoted 3 A, is the set 3A = Cl(A ) — Int(A). 


EXAMPLE 2.14. Let A = [—1, 1] in the standard topology on R. As il¬ 
lustrated in Figure 2.7, we have C1(A) = [—1,1] and Int(A) = (—1, 1), and 
therefore 3A = {—1,1}. 


A = C1(A) 


-1 

Int(A) 


-1 


i 

—> 

i 

3(A) 


-1 


1 


FIGURE 2.7: Determining the boundary of [-1,1] in R. 

_i 

-1 

EXAMPLE 2.15. Let A be the subset of the plane appearing on the far left 
in Figure 2.8. Then, as pictured, C1(A) is a closed ball, Int(A) is an open ball, 
and 3 A is a circle. 


The next theorem gives conditions that help us to determine whether or not 
a point x lies in the boundary of a set A. 
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FIGURE 2.8: Determining the boundary of A in the plane. 

THEOREM 2.14. Let Abe a subset of a topological space X and let x be a 
point in X. Then x € 3 A if and only if every neighborhood ofx intersects both 
A and X — A. 

Proof. To begin, suppose that x e 3 A. Then x e C\(A) and x Int(A). 
Since x € C1(A), it follows that every neighborhood of x intersects 
A. Furthermore, since x & Int(A), it follows that every neighborhood 
of x is not a subset of A and therefore intersects X — A. Thus, every 
neighborhood of x intersects A and X — A. 

Now suppose that every neighborhood of x intersects A and X — A. It 
follows that x € C1(A) andx € C1(X — A). By Theorem 2.6 implies that 
C1(X — A) — X — Int(A), and therefore x g Int(A). Thus, x € C1(A) 
and x ^ Int(A); that is, x € C1(A) — Int(A) = 3A. ■ 

Here are some quick facts about the boundary of a set A. They all follow 
readily from the definition. 

THEOREM 2.15. Let A be a subset of a topological space X. Then the 
following statements about the boundary of A hold: 

(i) 3 A is closed. 

(ii) 3 A = C7(A) n Cl(X - A), 
fill) 3A n Int(A) = 0. 

(iv) 3A U Int(A) = Cl(A). 

(v) 3 A C A if and only if A is closed. 

(vi) 3 AH A — 0 if and only if A is open. 

(vii) 3 A = 0 if and only if A is both open and closed. 

Proof. See Exercise 2.28. ■ 


EXAMPLE 2.16. Consider Q in the standard topology on R. Since C1(Q) = 
R, and Int(Q) = 0, it follows that 3Q = R. 

Therefore the whole real line R is the boundary of the rational numbers. 
That makes sense—every real number is arbitrarily close to the set of rational 
numbers and to its complement, the set of irrational numbers. 
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EXAMPLE 2.17. Consider the vertical interval .A = {0} x [—1, l]asasubset 
of R 2 with the standard topology. Viewing the interval this way, as a subset of 
R 2 rather than as a subset of R, makes a huge difference in how its boundary 
appears. Here, Int(A) = 0 and C1(A) = A; therefore dA = A. 


EXAMPLE 2.18. Let A = [—1, 1] in R with the discrete topology. Here 
Int(A) = [—1, 1] and C1(A) = [—1, 1], so dA = 0. 

Alternatively, we could observe that [— 1, 1] is both open and closed in the 
discrete topology, and therefore its boundary is empty by Theorem 2.15(vii). 


EXAMPLE 2.19. Let A = [—1, 1] as a subset of R;, the real line with the 
lower limit topology. In this case, C1(A) = [—1,1] and Int(A) = [—1,0), so 
3A = {-!}. 


The previous examples show that, as with interior and closure, the boundary 
of a set A depends on the topology on the set X containing A, not just on A 
itself. 

Exercises for Section 2.3 

2.24. Determine 9 A in each case. 

(a) A = (0,1] in the lower limit topology on R. 

(b) A = {a} in X = {a, b, c] with topology {X, 0, {a}, {a, b}}. 

(c) A = {a, c} in X = [a, b, c} with topology {X, 0, {a}, {a, b}}. 

(d) A = {£>} in X — {a, b , c] with topology [X, 0, {a}, {a, b}}. 

(e) A = (—1, 1) U {2} in the standard topology on R. 

(f) A = (— 1, 1) U {2} in the lower limit topology on R. 

(g) A = {(x, 0) € R 2 | x € R} in R 2 with die standard topology. 

(h) A = {(0, x) € R 2 | x € R} in R 2 with the topology generated by the basis 
in Exercise 1.19. 

(i) A = {(x, 0) € R 2 | x € R} in R 2 with the topology generated by the basis 
in Exercise 1.19. 

2.25. (a) For m < n e Z, let 7 m ,„ = [m, m + 1,... , n}. Determine 9(/ m ,„) in the 

digital line topology. (There are four cases to consider, allowing m to be 
either even or odd, as well as n .) 

(b) For n 6 Z, determine 9({n}) in the digital line topology, considering 
separately the cases where n is even and n is odd. Discuss how your 
results for 9({n}) reflect the digital image display structure modeled by 
the digital line as described in Section 1.4. 

2.26. Determine the boundary of each of the following subsets of R 2 in the standard 

topology: 

(a) A = {(x, 0) e R 2 | x e R}. 

(b) 5 = {(x,y)eM 2 |x>0,y#0). 

(c) C = {(±,0)eR 2 |neZ+}. 

(d) D = {(x, y) e R 2 | 0 < x 2 - y 2 < 1}. 
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2.27. Determine 3([0, 1]) in R with the finite complement topology. Justify your 
result. 

2.28. Prove Theorem 2.15: Let A be a subset of a topological space X. 

(a) 3 A is closed. 

(b) 3A=Cl(A)nCl(X-A). 

(c) 3A n Int(A) = 0. 

(d) 3AUInt(A) =C1(A). 

(e) 3 A C A if and only if A is closed. 

(f) 3A fl A = 0 if and only if A is open. 

(g) 3 A = 0 if and only if A is both open and closed. 


2.4 An Application to Geographic Information Systems 

A geographic information system (GIS) is a computer system capable of assem¬ 
bling, storing, manipulating, and displaying geographically referenced data. 
The data are often used for solving complex planning and management prob¬ 
lems. To analyze spatial information, users select data from a GIS by sub¬ 
mitting queries. Typical GIS queries incorporate spatial relations to describe 
constraints about spatial objects to be analyzed or displayed. 

For example, in studying wetland protection within a state’s recreation 
areas, a GIS user might ask for a display of all wetlands that lie partially or 
entirely within the state’s parklands. (See Figure 2.9.) The GIS would search 
for wetland areas and state parkland areas and examine the relationship of 
each to the other in order to return all wetlands that satisfied the specified 
requirements. 



FIGURE 2.9: Which wetlands lie partly or entirely within a state park? 

Evidently, in a GIS there is a need to be able to distinguish different ways 
that land regions can lay in relation to each other. For a mathematician, it often 
suffices to know if two sets intersect or not, but in a GIS a classification finer 
than intersect/not-intersect is needed. For example, in Figure 2.10, while sets 
A and B and sets A' and B' intersect, there is an obvious difference in the 
nature of their intersections. There is a need to make these distinctions precise. 
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FIGURE 2.10: A and B intersect in a different manner than do A' and B’. 

Furthermore, in Figure 2.11, while the initial view of sets C and D sug¬ 
gests that they intersect, after zooming in we see that they do not. Information 
stored in a GIS regarding the relationship between such sets should not depend 
on how the sets appear in a particular picture. Thus, by storing the information 
that C and D do not intersect, the GIS ensures that the nature of the relation¬ 
ship between the two sets is independent of potentially misleading pictorial 
representations of them, particularly those characterized by poor resolution. 



FIGURE 2.11: Sets C and D appear to intersect, but zooming shows they do not. 

Let us take another view of the GIS modeling challenge. Computer systems 
are very good at modeling and manipulating one-dimensional information. 
One-dimensional information is essentially numerical in nature, and computer 
systems possess a straightforward, workable model of the real number system, 
its arithmetic operations, and its order relationship. Computers can easily 
handle queries about one-dimensional information, such as “If I write out a 
check for $ 1000 will I overdraw my account?” and “What is the total population 
of the six New England states?” 

But when the nature of the information moves up a dimension, the modeling 
challenge becomes considerably more difficult. A model must allow answers 
to queries such as “In which states does Yellowstone National Park lie?” and 
“Is Illinois north of Florida?” A system capable of addressing such queries 
must keep track of the geographic objects, be able to perform operations on 
them, and be able to precisely define relationships such as “lies in” and “is north 
of.” Furthermore, such a system must match our intuition and understanding 
of such objects, operations, and relationships. Clearly, this is a significant 
modeling challenge. 
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Geographic information systems theory is one area in the field of Spatial 
Information Science and Engineering, a field that has undergone tremendous 
growth in recent years. It is a field where topology has provided valuable 
modeling tools. We present a simple model, originally published in [Ege], that 
employs topological concepts to define and distinguish relationships between 
pairs of geographic areas. This model is by no means complete, but it is a start. 
It has been adopted as a GIS-industry standard for describing the relationships 
it addresses. 

In the model we present here, our interest is in pairs of closed sets A and B 
in a topological space X, and our goal is to use topological concepts to examine 
the different ways in which A and B can lie in relation to each other in X. 

Given closed sets A and B in X, we consider the four intersections 3 A fl dB, 
A fl B, dA n B, and A n dB, examining whether or not they are empty. 


DEFINITION 2.16. For a set Y, define Cy — 0 if Y is empty and Cy = 1 if 
Y is not empty. Given closed sets A and B in X, define Ia,b» the intersection 
value for A and B in X, by 


Ia.b — (CdAndB, C^nB’ ^9/inB’ ^AC\dB )• 


Examples of intersection values for pairs of sets in the plane are shown in 
Figure 2.12. 



FIGURE 2.12: Intersection values for various pairs of sets in the plane. 

Since each of the four entries in the intersection value can be either 0 or 
1, there are 16 different possibilities for the intersection value. All 16 can be 
realized by pairs of closed sets in the plane. In Figure 2.13 we show eight of 
them. We ask you to find examples of the other eight in Exercise 2.34. 

In an application such as a GIS, we may want to restrict the types of sets 
under consideration. For example, we may wish to model only geographic 
areas such as the ones shown in Figure 2.9. Thus we would exclude sets such 
as those shown in Figure 2.13 that have a segment sticking out. Although we 
might want to allow infinitely thin arcs when we examine borders between 
countries, we do not want to allow them to represent land areas. How can we 
eliminate this whole class of possibilities? The next definition helps us do so. 
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FIGURE 2.13: Pairs of closed sets in the plane realizing eight of the possible inter¬ 
section values. 


DEFINITION 2.17. A set A for which A — Cl(Int(A )) is called regularly 
closed. 

For example, closed intervals [a, b ] with a < b are regularly closed in M 
with the standard topology, since the interior is ( a , b) and the closure of the 
interior is the original set [a, b]. The sets appearing on the left in Figure 2.14 
are regularly closed subsets of the plane in the standard topology, while those 
appearing on the right are not. 



FIGURE 2.14: Closed sets in the plane, some regularly closed, some not. 

Regularly closed subsets of the plane have no whiskers attached; all bound¬ 
ary points have interior points arbitrarily close by. (See Exercise 2.35.) Such a 
feature is a natural requirement in modeling geographic areas. Once we restrict 
ourselves to regularly closed sets, only some of the 16 intersection values will 
be possible. We make this explicit via the following theorem and corollary: 
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THEOREM 2.18. Let A and B be regularly closed in a topological space X. 
IfdA n 2? + 0, then ADB + 0. 

Proof. First, we prove that 3 Ac Cl (A). Since A is closed, it follows 
that 3 A c A. Furthermore, A = C1(A) by the definition of regularly 
closed. Thus, 3 A C C1(A). 

Now, to prove that if 3 A n B f 0, then ADB 0, assume 
3 A fl S ^ 0, and let x be an element of this intersection. Since x e 
3 A C Cl (A), it follows that every open set containing x intersects A. 
But, B is an open set containing x; therefore B intersects A. Hence, 
A fl B fz 0 , as we wished to show. ■ 

The following corollary is now evident: 

COROLLARY 2.19. Intersection values (l, 0, 1,0), (0, 0, 1,0), (1,0, 0, 1), 
( 0 , 0 , 0 , 1 ), ( 1 , 0 , 1 , l),and(0, 0 , 1 , 1 ) are not possible for a pair of regularly 
closed sets A and B in a topological space X. 

Each of the 10 intersection values not excluded by Corollary 2.19 can be 
realized by pairs of regularly closed sets in the plane. Some are shown in 
Figure 2.13; for the rest, see Exercise 2.34. 

The intersection value provides us with a simple means for distinguishing 
the different ways two sets can intersect, but it does not distinguish many 
different configurations. There are, however, situations where the intersection 
value allows us to draw specific conclusions about the nature of the relationship 
between two sets. For example, it is easy to see that if Ia.b = (0,0,0,0) then 
A and B are disjoint. With some straightforward assumptions about A and 
B we obtain a collection of sets for which the intersection value allows other 
specific conclusions to be drawn about the nature of the configuration of the 
two sets. 

DEFINITION 2.20. A planar spatial region is a nonempty proper subset C 
ofR 2 satisfying the following conditions: 

(i) It is regularly closed. 

(ii) The interior of C cannot be expressed as the union of two disjoint 
nonempty open sets. 

We have already addressed the significance of being regularly closed. The 
second condition in the definition of a planar spatial region is related to connect¬ 
edness, a topological property that is the subject of Chapter 6. This condition 
requires that the set not have separate interior parts that are, for instance, only 
accessible from each other by going through the boundary. (See Figure 2.15.) 

Closed balls and polygons are examples of planar spatial regions. Results 
about connectedness are needed to prove that such sets satisfy the second con¬ 
dition for being a planar spatial region. We develop those results in Chapter 6. 

In Figure 2.15 we show examples of subsets of the plane that are not planar 
spatial regions. The set A is not a planar spatial region because it is not regularly 
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FIGURE 2.15: Sets that are not planar spatial regions. 

closed. The set B is not a planar spatial region because its interior is the union 
of two disjoint nonempty open sets. 

The following theorem indicates that, if we restrict ourselves to planar 
spatial regions, the intersection value enables us to make specific conclusions 
about the relationship between the sets involved: 

THEOREM 2.21 . Let A and B be planar spatial regions. Only the intersec¬ 
tion values listed in the left column of the following table are possible for A and 
B. Furthermore, if the intersection value is as depicted in the left column, then 
the relationship between A and B is as depicted in the corresponding entry in 
the right column. 


Intersection Value 

Relationship 

(1,1, 1,0) 

AC B 

(0,1,1,0) 

A C Int(B) 

(1,1,0, 1) 

B c A 

(0,1,0, 1) 

B c Int(A) 

(1,1,0,0) 

A = B 

(0,0,0,0) 

A n b = 0 

(1,0,0,0) 

AOB = dAndB 

(0, 1 , 1 , 1) 

Int(A) n Int(B) ^ 0, 

A <£ B, and B <£ A 

(1,1,1,1) 


The results in the last four rows of the table actually hold for every pair of 
closed sets A and B in a topological space X. (See Exercise 2.30.) The proofs of 
the remaining conclusions in this theorem require aspects of connectedness that 
are established in Chapter 6. We revisit these results in a set of supplementary 
exercises in Section 6.1. 

The result in the fifth row is particularly interesting. Given two planar 
spatial regions, if we know that their intersection value is (1,1,0,0), then we 
can conclude that the sets must be equal. 

Both of the properties required for a set to be a planar spatial region are 
needed in Theorem 2.21. If we do not require both properties, then it is possible 
to find unequal sets A and B for which (1,1,0,0) is the intersection value. (See 
Exercise 2.31.) 
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The Open Geospatial Consortium is a group of software developers who 
have jointly established industry standards for GIS software. They have adopted 
a topologically based function, expanding on the intersection value, for coding 
how two geographic areas lie in relation to each other. (See [Ryd].) We will not 
be specific about what we medn by “geographic areas,” but they are examples 
of planar spatial regions. 

Since geographic areas are planar spatial regions, Theorem 2.21 applies. 
Consequently the Open Geospatial Consortium has adopted a language for 
describing the relationships between pairs of geographic areas, associating 
descriptive expressions to the intersection values as shown in the following 
table. These descriptive expressions cover all possibilities for the relationship 
between two geographic areas (as a result of Theorem 2.21) and they are 
mutually exclusive since they are associated to specific intersection values. 


Descriptive Expression 

Intersection Value 

Relationship 

A is disjoint from B 

(0, 0,0,0) 

A n b -f- 0 

A touches B 

(1,0,0, 0) 

An b = dAndB 

A equals B 

(1,1,0, 0) 

A — B 

A is within B 

(1,1,1,0) or (0, 1,1,0) 

A c B 

A contains B 

(1, 1,0, 1) or (0,1,0,1) 

Ad b 

A overlaps B 

(1,1, 1,1) or (0, 1,1,1) 

Int(A) H Int(R) ^ 0, 
A(jLB, and B (jL A 


Recall the scenario described previously where a GIS user submits a re¬ 
quest for a display of all wetlands that lie partially or entirely within a state’s 
parklands. In response to the query, the GIS checks each pair of stored geo¬ 
graphic areas consisting of a wetland area A and a state parkland area B and 
returns all of those pairs for which A equals B, A is within B, A contains B, 
or A overlaps B. 

Exercises for Section 2.4 

2.29. Let A = [a\, 02 ] and B — [b\ , £> 2 ] be closed and bounded intervals in R. In 

each of the following groups determine which of the four possible intersection 

values can be realized and which cannot for A and B in R. Depict those that 

can be realized and prove that the remainder cannot. 

(a) (0,0,0,0), (1,0,0,0), (0,1,0,0), (1,1,0,0) 

(b) (0,0,1,0), (1,0,1,0), (0,1,1,0), (1,1,1,0) 

(c) (0,0,0, 1), (1,0,0,1), (0,1,0,1), (1,1,0,1) 

(d) (0,0,1,1), (1,0,1,1), (0,1,1,1), (1,1,1,1) 

2.30. Let A and B be closed sets in a topological space X. 

(a) Prove that if I a, b = (0,0,0,0), then A n B = 0. 

(b) Prove that if I a, b = (L 0,0,0), then A n B = dA H 3 B. 

(c) Prove that if Ia,b = (L 1,1,1) or (0,1,1,1), then Int(A) n Int(B) / 0, 
A <j/L B, and B ’<£ A. 
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2.31. This exercise demonstrates that if we drop either of the defining conditions for 
planar spatial regions A and B, then Ia,b = (1,1,0,0) need not imply A = B. 

(a) Find an example of regularly closed sets A and B in the plane such that 
Ia,b = (1,1,0,0)andA#B. 

(b) Find an example of closed sets A and B in the plane, each having an 
interior that is an open ball, such that Ia,b = (1,1,0, 0) and A ^ B. 
(In Chapter 6 we prove that an open ball in R 2 is connected, meaning it 
cannot be expressed as the union of two disjoint nonempty open subsets. 
Therefore a set with an interior that is an open ball satisfies the second 
condition to be a planar spatial region.) 

2.32. Prove that if U is open and B = Cl([/), then B is regularly closed. 

2.33. Prove that a closed rectangle [a,b] x [c, d] is regularly closed as a subset of 
the plane. 

2.34. Provide examples of pairs of closed sets in the plane that realize the eight 
intersection values not depicted in Figure 2.13. Use pairs of regularly closed 
sets wherever possible. 

2.35. (a) Provide an example demonstrating that dA need not equal 3(Int(A)). 

(b) Show that 3 A = 3(Int(A)) for regularly closed sets A. 



Chapter 3 

Creating New Topological 
Spaces 

In this chapter, we look at some methods for creating new topological spaces, 
given particular topological spaces with which to start. In Section 3.1, we show 
how a subset of a topological space inherits a topology from the space itself. 
In Section 3.2, we show how combining topological spaces by taking their 
product results in a new topological space, the product space. In Sections 3.3 
and 3.4, we show how to glue portions of a topological space together to obtain 
a new space called a quotient space. Finally, in Section 3.5, we introduce 
configuration spaces and phase spaces, and we examine examples of how they 
arise in the physical sciences. 

Over the course of the chapter we introduce a number of new topological 
spaces. We continue to use the expression “topologically equivalent” when 
describing spaces that are indistinguishable as far as topology is concerned. In 
Chapter 4 we formally define topological equivalence via functions known as 
homeomorphisms. 

3.1 The Subspace Topology 

Given a subset Y of a topological space X, there is a natural way to define a 
topology on Y, based on the topology on X. 

DEFINITION 3.1 . Let X be a topological space and let Y be a subset ofX. 
Define Ty = [U Df | U is open in X}. This is called the subspace topology 
on Y and, with this topology, Y is called a subspace of X. We say that V C Y 
is open in Y if V is an open set in the subspace topology on Y. 

Thus, a set is open in the subspace topology on Y if it is the intersection 
of an open set in X with Y . 

Of course, we need to verify that the subspace topology is actually a topol¬ 
ogy: 

(i) First, we note that 0 and Y are both open in Y, since 0 = 0 n T 
and Y = X (1 Y. 

(ii) Next, we show that intersections of finitely many open sets in Y 
are open in Y. Suppose that V \,... , V„ are open in Y. Then for 
each i there exists Ui, open in X, such that V,- = I/,- D Y. Now, 

Vi n... n v„ = (Ui n Y) n... n (u„ n Y) 

= (£/i n... n u„) n y. 

Since U\ n ... n U n is open in X, it follows that V\ IT ... n V„ is 
an open set in X intersected with Y, and therefore is open in Y. 


94 
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(iii) Finally, we show that unions of arbitrarily many open sets in Y are 
open in Y. Thus, suppose that {V a } is a collection of sets that are 
open in Y. Then for each a there exists an open set U a in X such 
that V a = U a fl Y. Therefore, 


\JV a = 0 Y) = (1J U «) 0 Y - 

Now, U U a is open in X, implying that 1J V a is an open set in X 
intersected with Y. Thus, |J V a is open in Y. 

EXAMPLE 3.1. Consider I = [0, 1] as a subset of R with the standard 
topology. In the subspace topology on /, open sets in / are open sets in R 
intersected with I. For example, sets of the form (a, b), where 0 < a < b < 1, 
are open in the subspace topology on I (and open in R as well). Also, for 
0 < a < 1, the sets [0, a) and (a, 1] are open in the subspace topology on /, 
even though they are not open in R. (See Figure 3.1.) 




1 -> k -» 
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[ 0 , 1 ] 
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FIGURE 3.1: The subspace topology on / = [0,1]. 


EXAMPLE 3.2. Consider the subspace topology that the integers Z inherit 
from the standard topology on R. (See Figure 3.2.) Since open intervals in 
R are open in the standard topology, and since each integer is contained in an 
open interval that contains no other integer, the single-point sets containing 
each integer are open sets in the subspace topology on Z. But then arbitrary 
unions of these sets are open. Hence, every subset of Z is open in Z, and the 
subspace topology on Z is the discrete topology. 



« « • *###*##* * * 


FIGURE 3.2: The topology that Z inherits from R. 
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In the previous two examples, we can view the subspace topologies inher¬ 
ited from the standard topology on R as the standard topology on the corre¬ 
sponding sets. More generally, we have the following definition: 


DEFINITION 3.2. Let Y be a subset ofW. The standard topology on Y is 
the topology that Y inherits as a subspace of R" with the standard topology. 


Thus the circle, the sphere, and, in general, the n-sphere all have a standard 
topology as subspaces of a Euclidean space. The same holds true for the disk, 
the n-ball, the open disk, and the open n-ball. 


EXAMPLE 3.3. Let Y = [—1,0) U (0, 1] C R. In the standard topology on 
Y, both [—1,0) and (0, 1] are open sets. Hence, their complements, which are 
(0, 1] and [—1,0), respectively, are closed sets in the standard topology on Y. 
Thus, in the standard topology on Y, the sets [—1,0) and (0, 1] are both open 
and closed. 


In Definition 3.1 we defined “open in T.” Next we discuss closed sets in a 
subspace topology. 


DEFINITION 3.3. Let X be a topological space, and let Y C X have the 
subspace topology. We say that a set C C Y is closed in Y if C is closed in the 
subspace topology on Y. 

What does it mean for C to be closed in T? As always, closed means that 
the complement is open, so a set C c Y is closed in Y if Y — C is open in Y. 
On the other hand, the following theorem indicates that we can obtain closed 
sets in Y by taking the intersection of closed sets in X with Y: 


THEOREM 3.4. Let X be a topological space, and let Y C X have the 
subspace topology. Then C C Y is closed in Y if and only if C = D F\Y for 
some closed set D in X. 

Proof. See Exercise 3.3. ■ 


We will find it useful to be able to produce a basis for the subspace topology 
on a subset of a topological space, using a basis for the original space. 

THEOREM 3.5. Let X be a topological space and B be a basis for the topol¬ 
ogy onX.IfYcX, then the collection 

By = {B n Y | B e B) 


is a basis for the subspace topology on Y. 
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Proof. First, note that By is a collection of open sets in the subspace 
topology on Y. We use Theorem 1.13 to prove that By is a basis for the 
subspace topology. Suppose W is an open set in the subspace topology 
on Y. Let y e W be arbitrary. Then W = U ft Y, where U is open 
in X. There exists a basis element B in B such that y e B C U. (See 
Figure 3.3.) Thus yeBr\YcUdY=W. Since B D Y e By, it now 
follows by Theorem 1.13 that By is a basis for the subspace topology 
on Y. ■ 



FIGURE 3.3: Inheriting basis elements for the subspace topology. 


EXAMPLE 3.4. Consider the circle S 1 C R 2 with the standard topology. 
Because open balls form a basis for the standard topology on IK 2 , their inter¬ 
section with S' forms a basis for the standard topology on S 1 . The resulting 
basis elements are open intervals in the circle, consisting of all points between 
two angles in the circle. (See Figure 3.4.) 




FIGURE 3.4: Basis elements for S 1 are open intervals in the circle. 


IMPORTANT NOTE: When we refer to “the circle” as a topological space, 
we mean S 1 with the standard topology. When we say “a circle” we mean a 
space topologically equivalent to the circle. The same holds for ‘‘the disk” 
versus “a disk,” “the sphere” versus “a sphere,” and so on. 
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EXAMPLE 3.5. In R 3 , let C be the circle of radius 1 in the xz-plane with 
center at the point (2,0,0). Consider the subspace of R 3 swept out as C is 
rotated about the z-axis. (See Figure 3.5.) The resulting space is called the 
torus and is denoted by T. 



_i 

-! 

EXAMPLE 3.6. If S is a surface in R 3 , then the collection of open patches in 
S, obtained by intersecting open balls in R 3 with S, is a basis for the standard 
topology on S. (See Figure 3.6.) 


/ 

/ 


FIGURE 3.6: Open patches on the surface are basis elements for the standard 
topology. 
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Exercises for Section 3.1 

3.1. Let X = {(*, 0) e R 2 | x e R}, the x-axis in the plane. Describe the topology 
that X inherits as a subspace of R 2 with the standard topology. 
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3.2. Let Y — [— 1,1] have the standard topology. Which of the following sets are 
open in Y and which are open in R? 

A = (-1,-1/2) U (1/2,1) 

B = (—1, —1/2] U [1/2,1) 

C = [—1, —1/2) U (1/2, 1] 

D = [—1, —1/2] U [1/2, 1] 

£=ir=.(T±^> 

3.3. Prove Theorem 3.4: Let X be a topological space, and let Y c X have the 
subspace topology. Then C c Y is closed in Y if and only if C = D Pi Y for 
some closed set D in X. 

3.4. Let Y = (0,5]. Which of the following subsets of Y are open, and which are 
closed, in Y in the standard topology? 

(a) (0,1) (b) (0,1] (c) {1} (d) (0,5] (e) (1,2) 

(f) [1,2) (g) (1,2] (h) [1,2] (i) (4,5] (J) [4,5] 

3.5. Let Y = (0,5] have the subspace topology inherited from R with the lower 
limit topology. Which of the following subsets of Y are open, and which are 
closed, in Y in this topology? 

(a) (0,1) (b) (0,1] (c) {1} (d) (0,5] (e) (1,2) 

(f) [1.2) (g)(l,2] (h) [1,2] (i) (4,5] U) [4,5] 

3.6. Let Y = (0,4] U {5}. Which of the following subsets of T are open, and which 
are closed, in Y in the standard topology? 

(a) (0,1) (b) (0,1] (c) (1) (d) (0,4] (e) (1,2) 

(0 [1,4) (g) (L4] (h) [1,4] (i) {4} (j) {4,5} 

3.7. Let X be a Hausdorff topological space, and Y be a subset of X. Prove that the 
subspace topology on Y is Hausdorff. 

3.8. Let X be a topological space, and let Y c X have the subspace topology. 

(a) If A is open in Y, and Y is open in X, show that A is open in X. 

(b) If A is closed in Y, and Y is closed in X, show that A is closed in X. 

3.9. (a) Let X = {ieR|neZ+}. Show that the standard topology on K is the 

discrete topology. 

(b) Let K* = K U {0}. Show that the standard topology on K* is not the 
discrete topology. 

.10. Show that the standard topology on Q, the set of rational numbers, is not the 
discrete topology. 

.11. Let A be a subspace of X, and D c A. By Int^D and Int*Z) we mean, 
respectively, the interior of D in the subspace topology on A and the interior of 
D in the topology on X. Similarly, we define Cl aD and C1*D for the closure, 
and we define 3 aD and 3* D for the boundary. 

(a) Show that Int/iD c Afl Int*D, and provide an example showing that 
equality does not hold in general. 

(b) Explore the relationship between CU D and A n Cl* D . For each contain¬ 
ment c and D, either prove that it holds or find a counterexample. 

(c) Explore the relationship between 3^ D and A fl 3* D. For each containment 
C and D, either prove that it holds or find a counterexample. 
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3.2 The Product Topology 

Given two topological spaces X and Y, we would like to generate a natural 
topology on the product, X x Y. Our first inclination might be to take as the 
topology on X x Y the collection C of sets of the form U x V where U is open 
in X and V is open in Y. But C is not a topology since the union of two sets 
U\ x V\ and U 2 x V 2 need not be in the form U x V for some U c X and 
V c F. (See Figure 3.7.) However, if we use C as a basis, rather than as the 
whole topology, we can proceed. 


Y 



FIGURE 3.7: The union of the sets U\ x V\ and U 2 x V 2 need not be in the form 
U x V. 

DEFINITION 3.6. Let X and Y be topological spaces and X x Y be their 
product. The product topology on X x Y is the topology generated by the 
basis 

B = {U x V | U is open in X and V is open in F). 

Of course, we must verify that B actually is a basis for a topology on the 
product, X x Y. 

THEOREM 3.7. The collection B is a basis for a topology on X x Y. 

Proof. Every point (x, y) is in X x Y, and X x Y e B. Therefore the 
first condition for a basis is satisfied. 

Next assume that (x, y) is in the intersection of two elements of B. 
That is, (x, y) 6 (U\ x Vj) fl (t /2 x V 2 ) where U\ and t / 2 are open 
sets in X, and V\ and V 2 are open sets in Y. Let C /3 = U\ fl t / 2 and 
V 3 = V\ fl V 2 . Then t /3 is open in X, and V 3 is open in Y, and therefore 
lh x F 3 € B. Also, 

u 3 x v 3 = (Ui n u 2 ) x (v, n v 2 ) = {u x x v x ) n (U 2 x v 2 ), 

and thus (x, y) e t / 3 x V 3 c (t/i x Vi) fl (t / 2 x V 2 ). It follows that the 
second condition for a basis is satisfied. 

Therefore B is a basis for a topology on X x Y. ■ 
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EXAMPLE 3.7. Let X = {a,b, c) and Y = {1,2} with topologies 
{0, {b}, {c}, {a, b }, {b, c), X} and {0, {1}, F}, respectively. A basis for the 
product topology on X x Y is depicted in Figure 3.8. Each nonempty open set 
in the product topology on X x Y is a union of the basis elements shown. 
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FIGURE 3.8: A basis for the product topology on X x Y. 

IMPORTANT NOTE: As we indicated at the outset of this section, it is not 
only the products of open sets that are open sets in the product topology. Those 
sets, along with all possible unions of them, are what make up the product 
topology. 

As with open sets, products of closed sets are closed sets in the product 
topology. (See Exercise 3.19.) But here too, this does not account for all of the 
closed sets because there are closed sets in the product topology that cannot be 
expressed as a product of closed sets. For instance, the set {(a, 2), (c, 1), (c, 2)} 
is a closed set in the product topology in Example 3.7, but it is not a product 
of closed sets. 

In Definition 3.6, the basis B that we use to define the product topology 
is relatively large since we obtain it by pairing up every open set U in X with 
every open set V in Y. Fortunately, as the next theorem indicates, we can find 
a smaller basis for the product topology by using bases for the topologies on 
X and Y, rather than using the whole topologies themselves. 

THEOREM 3.8. IfC is a basis for X and V is a basis for Y, then 

£ = {CxD\CsC and D € V} 

is a basis that generates the product topology on X x Y. 

Proof. Each set C x D e £ is an open set in the product topology; 
therefore, by Theorem 1.13, it suffices to show that for every open set W 
in X x Y and every point (x, y) e W, there is a set C x D in £ such that 
(x, y) e C x D c W. But since W is open in X x Y, we know that there 
are open sets t/in X and V in Y such that (x, y) e U xV c W. Sox e U 
and y e V. Since U is open in X, there is a basis element C € C such 
thatx e C c U. Similarly, since V is open in Y, there is a basis element 
D e V such that y € D c V. Thus (x,y)eCxDcUxVcW. 
Hence, by Theorem 1.13, it follows that £ = {CxD|CeC and D e T>] 
is a basis for the product topology on X x Y. ■ 
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EXAMPLE 3.8. Let / = [0, 1] have the standardropology as a subspace of 
E. The product space I x I is called the unit square. (See Figure 3.9.) The 
product topology on / x I is the same as the standard topology on / x I as a 
subspace of E 2 . (See Theorem 3.9.) 
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FIGURE 3.9: The unit square, I x I. 


IMPORTANT NOTE: When picturing a product space, we can imagine a 
copy of the second space corresponding to each point in the first space, or vice 
versa. 


EXAMPLE 3.9. Let S 1 be the circle, and let I = [0,1] have the standard 
topology. Then S 1 x / appears as in Figure 3.10. We can think of it as a 
circle with intervals perpendicular at each point of the circle. Seen this way, 
it is a circle’s worth of intervals. Or it can be thought of as an interval with 
perpendicular circles at each point. Thus it is an interval’s worth of circles. 
The resulting topological space is called the annulus. 

The product space S 1 x (0,1) is the annulus with the innermost and out¬ 
ermost circles removed. We refer to it as the open annulus. 




FIGURE 3.10: Depicting S 1 x I. 
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EXAMPLE 3.10. Consider the product space 5 1 x S 1 , where S 1 is the circle. 
For each point in the first S l , there is a circle corresponding to the second S 1 . 
(See Figure 3.11.) Since each S 1 has a topology generated by open intervals 
in the circle, it follows by Theorem 3.8 that S 1 x S 1 has a basis consisting of 
rectangular open patches, as shown in the figure. The resulting space resembles 
the torus introduced in Example 3.5; in fact, they are topologically equivalent. 



EXAMPLE 3.11 . Let D be the disk as a subspace of the plane. The product 
space S 1 x D is called the solid torus. (See Figure 3.12.) If we think of the 
torus as the surface of a doughnut, then the solid torus is the whole doughnut 
itself. 



FIGURE 3.12: The product S 1 x D is the solid torus. 


Let A and B be subsets of topological spaces X and Y, respectively. We 
now have two natural ways to put a topology on A x B. On the one hand, we 
can view A x B as a subspace of the product X x Y. On the other hand, we 
can view A x B as the product of subspaces, A C X and B C Y. The next 
theorem indicates that both approaches result in the same topology. 
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THEOREM 3.9. Let X and Y be topological spaces, and assume that A C X 
and B C Y. Then the topology on Ax B as a subspace of the product X x Y is 
the same as the product topology onAxB, where A has the subspace topology 
inherited from X, and B has the subspace topology inherited from Y. 

Proof. See Exercise 3.15. ■ 


The approach used to define a product of two spaces extends to a product 
X\ x ... x X„ of n topological spaces. It is straightforward to see that the 
collection 


B = {U\ x ... x U n | Ui is open in X, for each i} 

is a basis for a topology on Xi x ... x X n . The resulting topology is called 
the product topology on X\ x ... x X n . We have an analog to Theorem 3.8 
for this case. Specifically, if B, is a basis for X; for each i = 1,... , n, then 
the collection 


B' = (Z?i x ... x B„ | Bi € Bi for / = 1,... , n} 
is a basis for Xi x ... x X„. 

Given topological spaces X, Y, and Z, the space (X x Y) x Z is the 
product of the spaces XxF and Z, the space X x (7 x Z) is the product 
of the spaces X and Y x Z, and the space X x Y x Z is the product of the 
three spaces X, y, and Z together, as described above. These three spaces 
are naturally topologically equivalent. (See Exercise 4.34.) Therefore, we do 
not distinguish between these spaces, and we always use the latter notation, 
without grouping, to denote a product of multiple spaces. 

In Section 1.2 we defined the standard topology on R" to be the topology 
generated by the basis of open balls defined by the Euclidean distance formula 
on R n . We also pointed out in Section 1.2 that the same topology results from 
taking a basis made up of products of open intervals in R. It follows that the 
standard topology on R” is the same as the product topology that results from 
taking the product of n copies of R with the standard topology. 


EXAMPLE3.12. Then-torus, T n , is the topological space obtained by taking 
the product of n copies of the circle, S 1 . 

_i 


The next theorem indicates that, nicely enough, the interior of a product is 
the product of the interiors. 

THEOREM 3.10. Let A and B be subsets of topological spaces X and Y, 
respectively. Then Int(A x B) = Int(A) x Int(B). 

Note that in the statement of the theorem, Int() is taking the interior in 
three separate topological spaces—Int(A x B) is the interior in X x Y, Int(A) 
is the interior in X, and Int(J3) is the interior in Y. 
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Proof. Since Int(A) is an open set contained in A, and Int(fi) is an open 
set contained in B, it follows that Int(A) x Int(B) is an open set in the 
product topology and is contained in A x B. Thus Int(A) x Int(jEf) c 
Int(A x B). 

Now suppose (x, y) e Int(A x B). We prove that (x, y) e 
Int(A) x Int(fi). Since (x, y) e Int(A x B), it follows that (x, y) is 
contained in an open set contained in A x B and therefore is also con¬ 
tained in a basis element contained in A x B. So there exists a U and 
V, open in X and Y, respectively, such that (x, y) e U x V C A x B. 
Thus, x is in an open set U contained in A, and y is in an open set V 
contained in B, implying that x € Int(A) and y e Int(15). Therefore 
(x, y) € Int(A) x Int(B). It follows thatInt(A x B) c Int(A) x Int(B). 

Since we have both Int(A) x Int(fi) c Int(A x B) andInt(A x B) C 
Int(A) x Int(B), the desired equality follows. ■ 

Results parallel to Theorem 3.10, for the closure and boundary of a product 
of sets, are considered in Exercises 3.20 and 3.22. 

Exercises for Section 3.2 

3.12. Is the finite complement topology on R 2 the same as the product topology on 
R 2 that results from taking the product R/ c xM; c , where R/ f is R in the finite 
complement topology? Justify your answer. 

3.13. Let X = PPR 2 0 0) , the particular point topology on R 2 with the origin serving 
as the particular point. (See Exercise 1.7.) Is X the same as the topology that 
results from taking the product of R with itself, where each R has the particular 
point topology PPRo? Justify your answer. 

3.14. Prove that the digital plane introduced in Section 1.4 is the topological space 
that results from taking the product of two copies of the digital line. 

3.15. Prove Theorem 3.9: Let X and Y be topological spaces, and assume that 
A c X and B c Y. Then the topology on A x B as a subspace of the product 
X x Y is the same as the product topology on A x B, where A has the subspace 
topology inherited from X, and B has the subspace topology inherited from Y. 

3.16. Let S 2 be the sphere, D be the disk, T be the torus, S l be the circle, and 
I — [0,1] with the standard topology. Draw pictures of the product spaces 
S 2 x /, T x /, 5* x / x /, and S l x D. 

3.17. If L is a line in the plane, describe the subspace topology it inherits from R/ x R 
and from R; xRj, where R; is the real line in the lower limit topology. Note 
that the result depends on the slope of the line. In all cases, it is a familiar 
topology. 

3.18. Show that if X and Y are Hausdorff spaces, then so is the product space Xxf. 

3.19. Show that if A is closed in X and B is closed in Y, then A x B is closed in 
X x Y. 

3.20. Show that if A c X and B c Y, then C1(A x B) = C1(A) x Cl(fl). 

3.21. Determine whether or not the sets in Figure 3.13 are open, closed, both, or 
neither in the product topologies on the plane given by R x R, R/ x R, and 
R/ xl|, where R/ is the real line in the lower limit topology. 
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FIGURE 3.13: Are these sets open, closed, both, or neither in R x M, 1/ x 
R, and R; xR/? 


3.22. Suppose that A c X and B c Y. 

(a) Provide an example demonstrating that 3(A x B) — 3(A) x 3(B) does 
not hold in general. 

(b) Derive and prove a relationship expressing 3(A x B) in terms of 3(A), 
3(B), A, and B. 


3.3 The Quotient Topology 

The concept of a quotient topology allows us to construct a variety of additional 
topological spaces from the ones that we have already introduced. Put simply, 
we create a topological model that mimics the process of gluing together or 
collapsing parts of one or more objects. One of the most well-known examples 
is the torus, as obtained from a square sheet by gluing together the opposite 
edges. (See Figure 3.14.) 



FIGURE 3.14: Gluing together opposite edges of a square to obtain the torus. 
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The quotient topology is defined as follows: 

DEFINITION 3.11. Let X be a topological space and Abe a set (that is not 
necessarily a subset of X). Let p : X -> A be a surjective map. Define a 
subset U of A to be open in A if and only if p~ ] (U) is open in X. The resultant 
collection of open sets in A is called the quotient topology induced by p , and 
ie function p is called a quotient map. The topological space A is called a 
, uotient space. 

It might not be clear how this definition of a topology enables us to glue 
the opposite edges of a square to obtain a torus, but we will see this in detail 
in Example 3.20. First, we verify that the quotient topology is a topology: 

THEOREM 3.12. Let p : X -* A be a quotient map. The quotient topology 
on A induced by p is a topology. 

Proof. We verify each of the three conditions for a topology. 

(i) The set p~' (0) = 0, which is open in X. The set p~ x (A) = 
X, which is open in X. So 0 and A are open in the quotient 
topology. 

(ii) Suppose each of the sets in the collection {Up}p € B is open in 
the quotient topology on A. Then p“ 1 ((J(^)) = U P -1 (^), 
which is a union of open sets in X, and therefore is open in 
X. Thus, \J(Up) is °P en in the quotient topology, implying 
that the arbitrary union of open sets in the quotient topology 
is an open set in the quotient topology. 

(iii) Suppose each of the sets [/,-,/ = 1,.... n, is open in the 
quotient topology on A. Then /? _1 (p| Ui) = Pi 

which is a finite intersection of open sets in X, and therefore 
is open in X. Hence, p) I/,- is open in the quotient topology, 
and it follows that the finite intersection of open sets in the 
quotient topology is an open set in the quotient topology. 

Hence, the quotient topology is a topology on A. ■ 


-, 

EXAMPLE 3.13. Give R the standard topology, and define 


p : R —» {a, b, c} by p(x) = 


a if x < 0, 
b if x = 0, 
c if x > 0. 


The resulting quotient topology on {a, b, c } is depicted in Figure 3.15. The 
subsets {a}, {c}, and {a, c} are all open since their preimages are open in R. 
But {b} is not open since its preimage is {0}, which is not open in R. 
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P 

-M- 

P~\a) p~\b ) p~\c) 

a b c 

FIGURE 3.15: The quotient topology on {a, b, c} induced by the function p. 



j 


EXAMPLE 3.14. Let R have the standard topology, and define p : 1R -> Z 
by p(x) = x if x is an integer, and p(x) = n if* e (n — 1, n + 1) and n is 
an odd integer. (See Figure 3.16.) So p is the identity on the integers, and p 
maps noninteger values to the nearest odd integer. 



P 


• • • • • Z 

-2-10 1 2 

FIGURE 3.16: The function p : R -► Z. 

In the resulting quotient topology on Z, if n is an odd integer, then {n} is 
an open set since p _1 ({«}) = (n — 1, n + 1), an open set in R. If n is an even 
integer, then {«} is not an open set since p~ l ({«}) is not open in R. In the 
quotient topology, the smallest open set containing an even integer n is the set 
{n - l,n,n + 1 }. 

It follows that the quotient topology induced by p on Z is the digital line 
topology introduced in Example 1.10. This quotient mapping demonstrates a 
natural passage from the line to a discrete model of it, an idea that is central in 
digital image processing. (See Sections 1.4 and 11.3.) The line is subdivided 
into open pixels—the intervals {n — 1, n + 1), for n odd—along with the 
boundaries between the pixels—the points n, for n even. Then, in the quotient 
space the pixels become the single-point open sets {n}, for n odd, and the 
boundaries between the pixels become the single-point closed sets { n }, for n 
even. 


Let X be a topological space. We are particularly interested in quotient 
spaces defined on partitions of X. Specifically, let X* be a collection of mu¬ 
tually disjoint subsets of X whose union is X, and let p : X -* X* be the 
surjective map that takes each point in X to the corresponding element of X* 
that contains it. Then p induces a quotient topology on X*. We think of the 
process of going from the topology on X to the quotient topology on X* as 
taking each subset S in the partition and identifying all of the points in S with 
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one another, thereby collapsing 5 to a single point in the quotient space. (See 
Figure 3.17.) A set U of points in X* is open in the quotient topology on X* 
exactly when the union of the subsets of X, corresponding to the points in U, 
is an open subset in X. 


X 

[ -*- X -*- 3 

A B C D E 



X* 


A B C D E 


FIGURE 3.17: Collapsing each set in the partition to a point in the quotient space. 


EXAMPLE 3.15. In Figure 3.18 we illustrate a quotient space defined on a 
partition. Let X = {a, b, c, d, e } with topology 

{0, {a}, {a, b], {a, b, c }, {a, b, c, d }, X}. 

With A = { a,b} and B = [c, d, e], let X* be the partition of X given by 
X* = {A, 5}. Note that X* is a two-point set. Since {a, b] is open in X and 
(c,d,e) is not, the only open sets in the quotient topology on X* are 0, {A}, 
and X* itself. 



X* 


C OO 

A B 


FIGURE 3.18: The space X and the quotient space determined by the partition 
X* = {A,B}. 


EXAMPLE 3.16. Let X = [0, 1], and consider the partition X* that is made 
up of the single-point sets {*}, for 0 < x < 1, and the double-point set 
D = {0, 1}. (See Figure 3.19.) Then, in the quotient topology on X*, we think 
of D as a single point, as if we had glued the two endpoints of [0,1] together. 

A subset of X* that does not contain D is a collection of single-point 
subsets, and it is open in X* exactly when the union of those single-point sets 
is an open subset of (0, 1). A subset of X* that contains D is open in X* when 
the union of all the sets making up the subset is an open subset of [0, 1], Such 
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an open subset must contain 0 and 1, and therefore must contain intervals [0, a) 
and ( b , 1J, which are open in the subspace topology on [0,1]. The resulting 
space is topologically equivalent to the circle, S 1 . 



FIGURE 3.19: Forming a quotient topology on [0, 1] by gluing the point 0 to the point 
1 . 


EXAMPLE 3.17. In the previous example, we glued the endpoints of an 
interval together to obtain a single point. That is an example of a more general 
construction that results in a space known as a topological graph. Specifically, 
a topological graph G is a quotient space constructed by taking a finite set 
of points, called the vertices of G, along with a finite set of mutually disjoint 
closed bounded intervals in R, and gluing the endpoints of the intervals to the 
vertices in some fashion. (See Figure 3.20.) The glued intervals are called the 
edges of G. 

We examine topological graphs and their role in graph theory further in 
Chapter 13. 




FIGURE 3.20: A topological graph is obtained by gluing endpoints of intervals to a 
set of vertices. 


EXAMPLE 3.18. In Example 3.16 we obtained a circle by identifying end¬ 
points of an interval in the real line. We describe a similar process here, using 
the digital line, that yields spaces we call digital circles. 

Specifically, a digital interval is a subset [m,m + 1,... , n} of Z with 
the subspace topology inherited from the digital line topology. Let l n be the 
digital interval in the form [1, 2,... , n — 1,«}. If n > 5 is an odd integer, 
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then the topological space C„_i resulting from identifying the endpoints 1 and 
n in /„ is called a digital circle. In Figure 3.21 we illustrate I-j and C(, along 
with a basis for each. By definition, a digital circle contains an even number 
of points. 



FIGURE 3.21: The digital circle C$ is a quotient space of the digital interval Ij. 


In Exercise 3.31 we examine the topological spaces that result when we 
take a general digital interval [m,m + \,... ,n] and identify the endpoints m 
and n. Digital circles arise only when both m and n are odd. 


EXAMPLE 3.19. Here we look at two different quotient spaces defined on 
/ x • /. In the first case, we define a partition by taking subsets of the following 
form: 


(i) A Xi y = {(x, y)} for every x and y such that 0 < x < 1 and 
0 < y < 1. 

(ii) By = {(0, y), (1, y)} for every y such that 0 < y < 1 . 

In the quotient topology, the subsets B y cause the left and right edges of the 
square to be glued as shown in Figure 3.22. The result is a space that is 
topologically equivalent to the annulus. 









FIGURE 3.22: Obtaining the annulus by gluing the left and right edges of a square. 
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In the second case, we define a partition by taking subsets of the following 
form: 


(i) A x , y = {(x, y)} for every x and y such that 0 < x < 1 and 
0 < y < 1. 

(ii) B* = {(0, y), (1, 1 — y)} for every y such that 0 < y < 1. 

Here the subsets B* also cause the left and right edges of the square to be glued. 
But, as we see in Figure 3.23, in order to accomplish the gluing, we need to 
perform a half twist so that the identified points on the edges can be properly 
brought together. The result is the well-known Mobius band. 



FIGURE 3.23: Obtaining the Mobius band by gluing the left and right edges of a 
square. 


j 


In situations similar to Example 3.19, we denote the gluings that we do by 
putting direction arrows on each of a pair of edges whose points are identified. 
Then we match the points on the two edges in a one-to-one fashion so that the 
directions on the glued edges coincide. 


-, 

EXAMPLE 3.20. Define a partition of / x I by taking subsets of the following 
form: 


(i) A Xj y = {(x, y)} for every x and y such that 0 < x < 1 and 
0 < y < 1. 

(ii) By = {(0, y), (1, y)} for every y such that 0 < y < 1 . 

(iii) C x = {(x, 0), (x, 1)} for every x such that 0 < x < 1. 

(iv) D = {(0,0), (0,1), (1,0), (1,1)}. 

In the quotient topology, the two-point subsets in (ii) cause the gluing of the 
left edge of the square to the right edge, and the two-point subsets in (iii) cause 
the gluing of the top edge of the square to the bottom edge. (See Figure 3.24.) 
Furthermore, the four-point subset causes the gluing of the four comers of the 
square to a single point. The topological space we obtain is therefore the result 
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of taking a square and gluing together its opposite edges. As shown previously 
in Figure 3.14, such a construction results in a torus. 



D 


flv 


D 





FIGURE 3.24: Partitioning the square to obtain a torus. 


In Figure 3.24, even though we do not explicitly show that the four corners 
of the square are identified, note that the gluing of the edges labeled 1 and the 
gluing of the edges labeled 2 identify all four comers with each other. 

An advantage to this representation of the torus is that we can imagine 
what it is like to move around on the toms sitting in 3-space, without ever 
leaving the flat two-dimensional representation that keeps track of the gluings. 
Each time we reach a point on the top edge of the square, we are transported 
to the corresponding point on the bottom edge. This process allows us to view 
a space in a setting of one dimension less than the setting in which it naturally 
resides. We will find this approach to viewing a space particularly useful when 
visualizing higher-dimensional spaces in Chapter 14. 

This idea of a square with opposite edges identified occurs in some com¬ 
puter games. If we are moving around on the screen and we go off the left 
edge of the screen, we reappear at the same height on the right edge. If we go 
off the top edge, we reappear on the bottom edge. So, in fact, these games are 
actually being played on the toms. 

In the next section, we examine more examples of spaces obtained by 
identifying points on the edges of polygons, and we examine a few three- 
dimensional examples obtained by gluing faces of a cube. 


Exercises for Section 3.3 

3.23. If R has the standard topology, define 


p : R -»■ {a, b, c, d, e } by p(x) = 


a if x > 2, 
b if x = 2, 
c if 0 < x <2, 
d if — 1 < x < 0, 
e if x < — 1. 


(a) List the open sets in the quotient topology on {a, b, c, d, e}. 

(b) Now assume that R has the lower limit topology. What are the open sets 
in the resulting quotient topology on {a, b, c, d, e}? 
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3.24. Let X = R in the standard topology. Take the partition 

X* = {... ,(-l,0],(0, 1], (1,2],...}. 

Describe the open sets in the resulting quotient topology on X*. 

3.25. Define a partition of X = R 2 - {O} by taking each ray emanating from the 
origin as an element in the partition. (See Figure 3.25.) Which topological space 
that we have previously encountered appears to be topologically equivalent to 
the quotient space that results from this partition? 



FIGURE 3.25: A partition of X = R 2 - {O}. 

3.26. Devise the rules for a game of tic-tac-toe that is played on the surface of the 
torus, using the square model of the torus as illustrated in Figure 3.26. What 
new three-in-a-rows would this game allow that are not allowed in the standard 
game? 


X 

— rr — 

X 



0 

J 


_Sk— 

0 


FIGURE 3.26: Tic-tac-toe on a torus. 


3.27. Provide an example showing that a quotient space of a Hausdorff space need 
not be a Hausdorff space. 

3.28. Consider the equivalence relation on R defined by x ~ yifx — y 6 Z. Describe 
the quotient space that results from the partition of K into the equivalence classes 
in the equivalence relation. 

3.29. Consider the equivalence relation on R 2 defined by (x \, xi) ~ (w\, wt) if 
x\ + X 2 — wi + W 2 - Describe the quotient space that results from the partition 
of R 2 into the equivalence classes in this equivalence relation. 
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3.30. Consider the equivalence relation on R 2 defined by (x \, x-£) ~ (u>i, W 2 ) if 
x\ + x\ = w\ + w\. Describe the quotient space that results from the partition 
of M 2 into the equivalence classes in this equivalence relation. 

3.31. Consider the four cases of m and n being either even or odd, and describe the 
topological spaces that result when we identify the endpoints m and n in a 
general digital interval {m, m + 1,... , n}. 

3.32. (a) Show that the digital line can be obtained as a quotient space that results 

from a partition of R in the standard topology. 

(b) Show that the digital plane, introduced in Section 1.4, can be obtained as a 
quotient space that results from a partition of R 2 in the standard topology. 

3.33. In each of the following cases, describe or draw a picture of the resulting quotient 
space. Assume that points are identified only with themselves unless they are 
explicitly said to be identified with other points. 

(a) The disk with its boundary points identified with each other to form a 
single point. 

(b) The circle S 1 with each pair of antipodal points identified with each other. 

(c) The interval [0,4], as a subspace of R, with integer points identified with 
each other. 

(d) The interval [0,9], as a subspace of R, with even integer points identified 
with each other to form a point and with odd integer points identified with 
each other to form a different point. 

(e) The real line R with [—1,1] collapsed to a point. 

(f) The real line R with [-2, — 1] U [ 1,2] collapsed to a point. 

(g) The real line R with (-1, 1) collapsed to a point. 

(h) The plane R 2 with the circle S l collapsed to a point. 

(i) The plane R 2 with the circle S l and the origin collapsed to a point. 

(j) The sphere with the north and south pole identified with each other. 

(k) The sphere with the equator collapsed to a point. 

3.34. An optical character-recognition program attempts to recognize a character 
(for example, a letter of the alphabet) from a computer image of it. Topological 
properties of each letter’s shape can help in this identification. Create a topo¬ 
logical graph representation of each letter of the alphabet, and in each case use 
a topological graph with as few edges as possible. 


3.4 More Examples of Quotient Spaces 

We ended the previous section with a description of how to obtain a torus 
by gluing together opposite edges in a square. In this section, we examine 
other quotient spaces obtained by gluing together parts of familiar geometric 
figures. These are all examples of topological spaces known as manifolds. We 
investigate manifolds and some of the other ideas introduced in this section in 
more detail in Chapter 14. 


EXAMPLE 3.21. Identify the opposite edges of a square, with the left and 
right edges glued as in Example 3.20, but now with the top edge glued to the 
bottom edge so that the left end of the top edge glues to the right end of the 
bottom edge, and the right end of the top edge glues to the left end of the bottom 
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edge. In other words, the top edge is flipped before it is glued to the bottom 
edge. (See Figure 3.27.) The resulting topological space is called the Klein 
bottle, named for the mathematician Felix Klein (1849-1925). 



FIGURE 3.27: Gluing the edges of a square to obtain a representation of the Klein 
bottle. 


In Figure 3.27, we show how to glue together a Klein bottle. We first 
glue the left and right edges of the square to create a cylinder. Now, when we 
attempt to glue the top circle of the cylinder to the bottom circle by bending the 
cylinder as we did for the torus, we find that the arrows on the two circles do not 
match. The only way to successfully match the arrows in 3-space is to allow 
the cylinder to pass through itself as if it were a soap film instead of a solid sheet 
of rubber. Then we can successfully glue the two circles together so that the 
arrows match. But of course, the actual Klein bottle does not have a circle of 
intersection with itself as this object does. In fact, the actual Klein bottle does 
not exist in 3-space. It can be constructed in 4-space without self-intersection. 
(See Chapter 11.) In 3-space, this representation with self-intersection is the 
best we can do. 


EXAMPLE 3.22. Next we consider two more surfaces obtained as quotient 
spaces—one familiar, the other new and somewhat unusual. In both cases, we 
take the disk and divide its edge into two semicircles that we glue together. 
(See Figure 3.28 and 3.29.) 

First, consider the quotient space obtained by the gluing shown in Fig¬ 
ure 3.28. In that case, we are essentially zipping up a change purse to obtain a 
surface that is topologically equivalent to the sphere. 
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FIGURE 3.28: Gluing the boundary of a disk to obtain a sphere 


Now consider the quotient space obtained by the gluing shown in Fig¬ 
ure 3.29. The resulting topological space is known as the projective plane, 
denoted P. As in the Klein bottle example, we cannot construct the projective 
plane in 3-space. The best we can do to visualize the projective plane is to 
create a representation of it in 3-space, allowing self-intersection. 



FIGURE 3.29: Gluing the boundary of a disk to obtain a representation of the 
projective plane. 


In the illustrated construction of P, we begin by gluing together the points 
labeled p. The result is a bowl pinched along its rim. There are now two circles 
that need to be glued, but, unfortunately, as we bring them toward each other, 
we see that their arrows do not match properly, and we cannot directly glue 
them. We can, however, glue the back half of the left circle to the front half of 
the right circle, as we have done in the second-to-last stage in the illustration, 
leaving the other half of each circle unglued. To finish the gluing, we need to 














118 Chapter 3 Creating New Topological Spaces 


pass part of the surface through itself. In the last stage in the illustration, we glue 
the remaining half circles together, and we depict it as if it is happening along 
the same vertical segment where the other two half circles are glued together. 
What is really happening, though, is that the surface is passing through itself 
along that segment so that the gluings are separate. 


EXAMPLE 3.23. Consider the topological space obtained by identifying an¬ 
tipodal points (each pair of points x and —x) on the sphere. We show that the 
result is topologically equivalent to the projective plane. 

With the equator as shown in Figure 3.30, note the following: 

(i) Every point on the open northern hemisphere is identified with its 
antipode on the open southern hemisphere. 

(ii) Every point on the equator is identified with its antipode on the 


r, ... 

l - •' 5 

'• fy f- ? k 


FIGURE 3.30: Identifying antipodal points on the sphere results in a projective plane. 


We may remove the open northern hemisphere, rotate it, invert it, and 
then glue it directly onto the southern hemisphere as shown. Consequently, 
what remains is a bowl whose rim is glued together according to the arrows 
in Figure 3.30. That is equivalent to having a disk with the edge semicircles 
glued together as in Figure 3.29. Thus, the resulting space is topologically 
equivalent to the projective plane, as claimed. 
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Given two surfaces Si and S 2 , there is a natural way to adjoin them to obtain 
a new surface. We describe it here, but leave the definition for Chapter 14. From 
both Si and S 2 , remove the interior of a disk, leaving a hole surrounded by 
a circle. Then glue the two surrounding circles together via a quotient map. 
(See Figure 3.31.) The result is called the connected sum of Si and S 2 and is 
denoted Si#S 2 . 



FIGURE 3.31: The connected sum of two surfaces. 


EXAMPLE 3.24. In Figure 3.32 we show the surface, T # T, that results from 
taking the connected sum of two tori. 



FIGURE 3 32: The connected sum of two tori. 


By a process referred to as cutting and pasting we can obtain a repre¬ 
sentation of T # T as a polygon with edges glued together as illustrated in 
Figure 3.33. We start with two versions of the torus, represented as squares 
with edges glued together as in (a). Then we remove the interior of a disk from 
each as shown in (b). We choose those disks so that the boundary circle of each 
touches the vertex on the top edge and the right side edge of the corresponding 
square. This is for our convenience. Now, we open up those circles and stretch 
the squares out to pentagons, as shown in (c). For each of the resulting new 
edges, the two endpoints are still identified with each other, once we do all of 
the gluings on the edges. So in reality, each of these edges remains a circle in 
each surface. Next, we glue these two new edges together. This is the equiva¬ 
lent of taking the connected sum of two tori, since we are gluing together the 
boundaries of the two disks whose interiors we removed. Thus, in (d) we have 
T#T, represented as an octagon with pairs of edges glued together. 


EXAMPLE 3.25. What do we obtain when we take the connected sum of two 
projective planes? In Figure 3.34, through cutting and pasting, we See that at 
step (e), P#P can be represented as a square with its edges glued together. If we 



FIGURE 3.34: The connected sum of two projective planes is topologically equiva¬ 
lent to a Klein bottle. 
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cut across this square as in (f) and glue the edges labeled 2, then the result is a 
square with its edges glued together to yield a Klein bottle, as in Example 3.21. 
Thus, the connected sum of two projective planes is topologically equivalent 
to a Klein bottle. 


In Figure 3.35 we show, through cutting and pasting, that removing the 
interior of a disk from a projective plane results in a Mobius band. Therefore, 
taking the connected sum of two projective planes is the same as gluing two 
Mobius bands together along their edges. The result is topologically equivalent 
to a Klein bottle. 



FIGURE 3.35: Removing the interior of a disk from a projective plane results in a 
Mobius band. 


EXAMPLE 3.26. Consider the quotient space X obtained by identifying op¬ 
posite faces of a cube / x / x /, as shown on the far left in Figure 3.36. We 
cannot build the corresponding object in 3-space, but the result of the gluing 
is a well-defined topological space. We can recognize the resulting topolog¬ 
ical space as follows: Glue the front and back identified faces together as in 
Figure 3.36. We obtain S 1 x / x I, a solid torus, with the top and bottom 
annuli and the inside and outside annuli identified as shown. In S 1 x / x /, at 
each point on the circle S 1 , we have a square / x 7. In the quotient space X, 
the identifications on the top and bottom annuli and on the inside and outside 
annuli correspond to identifying the top and bottom edges and the side two 
edges, respectively, of each of these squares, as shown. The result of the iden¬ 
tifications on each square is a torus. Therefore, we can view X as a product 
of a circle S 1 and a torus T. Since the torus is S 1 xS 1 , it follows that X is 
topologically equivalent to S 1 x S 1 x S 1 , the 3-torus. 



FIGURE 3.36: Identifying opposite faces of the cube as shown results in the 3-torus. 
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EXAMPLE 3.27. Consider the quotient space obtained by identifying oppo¬ 
site faces of a cube, as shown in Figure 3.37. If we take a line segment running 
from the center of the left face of the cube to the center of the right face, then 
when we glue the space together, the segment becomes a circle. If we take a 
cylinder centered on that segment, then when we glue the space together, the 
circles at the ends of the cylinder are glued together with a flip, resulting in 
a Klein bottle. Therefore, within the space there is a collection of concentric 
Klein bottles that shrinks down to a core circle. 



FIGURE 3.37: Gluing the faces of a cube to obtain S l x P. 

In Exercise 3.38 we ask you to show that this quotient space is topologically 
equivalent to S 1 x P, the product of a circle and a projective plane. 

_i 


The spaces in Examples 3.26 and 3.27 are examples of 3-manifolds. We 
investigate 3-manifolds further in Section 14.3, and, in Sections 14.4 and 14.5, 
we discuss which 3-manifolds could possibly correspond to the overall shape 
of the universe. 

Exercises for Section 3.4 

3.35. On a sketch of the surface T # T, illustrate where the glued edges of the octagon 
in Figure 3.33 appear. 

3.36. (a) Show that a hexagon with opposite edges glued together straight across 

yields a toms. 

(b) Show that a hexagon with opposite edges glued together with a flip yields 
a projective plane. 

3.37. Give a representation of T#P, the connected sum of a toms and a projective 
plane, as a hexagon with pairs of edges glued together. 

3.38. Show that the quotient space in Example 3.27 is topologically equivalent to 
S 1 x P, the product of a circle and a projective plane. 

3.39. Show how to define a quotient space on a cube that results in S 1 x K, the 
product of a circle and a Klein bottle. Verify that the gluing of your cube yields 
S l x K. 
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3.5 Configuration Spaces and Phase Spaces 

The connections between topology and physics ran deep. It would take many 
volumes to cover all of the applications of topology to physics and it would 
require an advanced knowledge of topology to understand the most sophisti¬ 
cated applications. In many ways, there has been a strong interplay between 
the two fields—many topological concepts have been motivated by the study of 
physics applications, and many physics applications have been best understood 
through topological concepts. 

The most basic topological construction in physics is that of the config¬ 
uration space. In studying a system, we often need to keep track of a set 
of variables associated with the position and arrangement of a collection of 
objects of interest. For instance, we might want to keep track of the positions 
and orientations of the various parts of an industrial robot arm. The configu¬ 
ration space is a topological space that enables us to accomplish this variable 
tracking. Along with position, we may also want to consider the velocity or 
momentum of the objects in a system. In that case we use a space that includes 
both position and momentum variables; such a space is referred to as a phase 
space for the system. 

In this section, we look at a few examples of configuration spaces and phase 
spaces, and we show how some of the topological spaces and constructions that 
we have already introduced arise naturally in applications. 


EXAMPLE 3.28. To begin, consider the simple system where we have a rod 
with one end pinned at a point in the plane, around which the rod is free to 
rotate. (See Figure 3.38.) To each position of the rod, we associate a point on 
a circle centered at the pivot point, and to each such point on the circle, we 
associate a corresponding unique position of the rod. Thus the positions of the 
rod in the plane are modeled by the configuration space S 1 . 



FIGURE 3.38: The configuration space of a rotating rod with one end pinned in the 
plane is a circle. 


EXAMPLE 3.29. Now consider the planar two-rod system pictured in Fig¬ 
ure 3.39. Rod A has one free end and one end that is pinned in the plane at a 
point around which the rod can rotate. Rod B is linked to the free end of rod A 
at a joint that allows rotation. Since the pinned end of rod A and the linked end 
of rod B each allow rotation through a full circle of positions, the configuration 
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FIGURE 3.39: The configuration space of the two-rod system is the torus. 

space of the system is S 1 x S 1 , which is the torus. For each point (9 a, 6b) on 
the torus, there is a configuration of our system, and each configuration of our 
system corresponds to a unique point on the torus. 


It is important to avoid confusing the configuration space in Example 3.29 
with the space in the plane that is swept out by the free end of the system. 
(See Figure 3.40.) Imagine putting a pen at the end of rod B and observing 
what space the pen traces as we run through all possible configurations of our 
system. If rod B and rod A are not equal in length, then the pen traces out an 
annulus in the plane. When the length of rod A equals the length of rod B, the 
pen traces out a disk. This “traced out” space is referred to as the operational 
space of the system. We explore the relationship between the configuration 
space and the operational space further in Section 4.3. 



FIGURE 3.40: The operational space of the two-rod system is either an annulus or 
a disk. 

The system in Example 3.29 is known as a linkage. A linkage is a collection 
of rigid parts that are connected together at joints where the parts are free to 
move relative to each other. In the eighteenth and nineteenth centuries, linkages 
were of mathematical interest because of their geometric properties. In the 
latter half of the nineteenth century, with the dawn of the industrial revolution, 
linkages became of applied interest as well, providing important components in 
machine design. Linkages are still employed in many mechanical systems. For 
example, a linkage converts the rotary motion of a motor into the back-and-forth 
motion of a windshield wiper. Linkages also continue to be of mathematical 
interest, generating problems explored in a variety of areas of mathematics. 
(For example, see [Con] and [Thu2].) 
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As manufacturing processes have become more complex and automated, so 
have the mechanical components involved. Robot arms are often employed to 
perform repetitive tasks that are done more cheaply, more accurately, and more 
precisely by a robot than by a human. In the next example and in Exercise 3.40, 
we examine configuration spaces associated with robot arms. 


EXAMPLE 3.30. In Figure 3.41 we depict a robot arm that can vary along 
two angles and one length. 



FIGURE 3.41: A robot arm 

The base of the robot arm can turn a full 360 degrees. The correspond¬ 
ing configuration-space coordinate is a point on the circle S 1 . The arm can 
turn from 0 to 90 degrees and therefore the corresponding configuration-space 
coordinate is constrained to lie in the interval [0, 90]. The arm can extend 
from 0 to 25 centimeters, so its configuration-space coordinate is constrained 
to lie in the interval [0,25]. Thus, the configuration space of this robot arm 
is S 1 x [0, 90] x [0, 25], as illustrated in Figure 3.42. This is topologically 
equivalent to 5 1 x / x 7, a solid torus. 



FIGURE 3.42: The configuration space for the robot arm is a solid torus. 


EXAMPLE 3.31 . Here we look at a simple application to sheet-metal folding. 
Suppose we have a square of sheet metal that we can fold along the horizontal 
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FIGURE 3.43: The square can be folded along the horizontal axis and the vertical 
axis. 

axis and the vertical axis as shown in Figure 3.43. Let Oh be the fold angle 
of the top half of the square with respect to the bottom half, where Oh = 0 
represents a fold all of the way over to the front, Oh = 2n represents a fold all 
of the way to the back, and Oh = n corresponds to the unfolded situation as in 
the figure. Similarly, we let Oy be the fold angle of the left half of the square 
with respect to the right half. 

We make the physically reasonable assumption that we can fold along the 
vertical axis only when the square is folded along the horizontal axis with 
Oh = 0, n, or 2tc. We make the corresponding assumption for folding along 
the horizontal axis. The configuration space of possible pairs of folding an¬ 
gles ( Oh , Oy) is the subspace of the OnOy- plane, made up of three horizontal 
segments and three vertical segments, as illustrated in Figure 3.44. 

2tt -- 

Oy 7 r - 

o L-- 

0 TT 2v 

Oh 

FIGURE 3.44: The configuration space of possible pairs of fold angles ( 0n,0y ). 

The bottom segment in the figure corresponds to varying the fold angle 
along the horizontal axis while the left half of the square is folded all of the 
way over in front of the right half. The other segments in the figure can be 
interpreted in like fashion. 


EXAMPLE 3.32. In this example, we consider a bead in motion on an infinite 

P 



FIGURE 3.45: A bead in motion on an infinite wire. 
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wire, as illustrated in Figure 3.45. In this case, not only are we interested 
in the bead’s position, but we also wish to track its momentum. Thus, two 
variables are needed, x e R for position, and p e R for momentum. Here we 
have a phase space rather than a configuration space, and the phase space is 
{(x, p) | x, p e M}. This, of course, is just the product space 8x8 = R 2 . 


EXAMPLE 3.33. Suppose an object of mass m is falling or rising vertically, 
near the surface of the earth, under the influence of gravity. Let x be the height 
above the ground and p be the momentum of the object. Then the phase space 
is given by {(x, p) | x, p e R, x > 0} = R+ x R. For motion within this 
system, assume that the total energy is conserved. The total energy is expressed 

as the sum of the potential energy, mgx (where g represents the acceleration 

2 

due to gravity) and the kinetic energy, Then, when we examine phase 
space tracks of the changing position and momentum of our object, we find 
that the tracks must remain restricted to a subspace of R + x R of the form 

Ec = {(*, p) € R + x R | mgx + ^ = C), 

for C > 0. (See Figure 3.46.) The sets Ec partition the phase space into 
parabolic curves of constant energy along which all phase space tracks of the 
system’s motion must lie. 



FIGURE 3.46: Constant energy curves in the phase space of a falling or rising 
object. 


j 


Often, in the study of a given system, conserved quantities (such as total 
energy in the previous example) define subspaces of the configuration space 
or phase space that aid in the investigation of the system. 


EXAMPLE 3.34. Suppose we have a molecule, M, that consists of two dis¬ 
tinct atoms, and we wish to record its position and orientation in 3-space. (See 
Figure 3.47.) We associate to the molecule the point in R 3 at its center of mass. 
This enables us to locate the molecule in 3-space, but we need more informa¬ 
tion to determine the orientation of the molecule once its center of mass has 
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FIGURE 3.47: An oriented molecule in 3-space. 

been situated. This can be accomplished by associating a point on the 2-sphere 
to each possible orientation, as follows: 

We pick one of the two atoms, and to each orientation of the molecule, we 
associate a vector pointing from the center of mass to the center of the chosen 
atom. To that vector (and therefore that orientation) we associate the point on 
the sphere to which the vector points when placed with its tail at the center 
of the sphere. In this way, each orientation corresponds to a different point 
on the sphere. Hence, the configuration space for recording the position and 
orientation of the molecule in 3-space is IR 3 x S 2 . 


EXAMPLE 3.35. Now suppose that the molecule in Example 3.34 is made 
of two identical atoms. Again, every point in R 3 corresponds to a possible 
location of the molecule’s center of mass. Also, at each location in R 3 , points 
on the sphere correspond to each possible orientation of the molecule. How¬ 
ever, because of the symmetry of the molecule, each pair of antipodal points in 
the sphere corresponds to indistinguishable configurations. Thus we can “re¬ 
duce” the configuration space by identifying antipodal points on each sphere 
of molecule orientations. Therefore the configuration space is R 3 x P, where 
P is the projective plane. 


As in Example 3.35, it is common in physics and chemistry to obtain a 
configuration space or a phase space as a quotient space of another space, 
taking advantage of symmetry properties of the system. 

Exercises for Section 3.5 

3.40. Determine the configuration space for each of the following three robot arms, 

which are illustrated in Figure 3.48: 

(a) The robot arm has two components that can rotate through a full circle. The 
whole robot arm can be translated along a base track that is 50 centimeters 
long. The height of the robot arm can be raised 20 centimeters from its 
lowest position. 

(b) The robot arm has four moving components that each can rotate through 
a full circle. 

(c) The robot arm has two moving components that each can rotate through a 
full circle. The arm of the robot can rotate 90 degrees from a horizontal 
position to a vertical position. 
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(a) (b) (c) 


FIGURE 3.48: A variety of robot arms. 

3.41. Consider the system shown in Figure 3.49. It consists of a rod with one end free 
and one end attached to a track that is one meter long. The point of attachment 
can slide along the track, and the rod can rotate through a full circle at the point 
of attachment. Determine the configuration space of the system. 



3.42. Suppose we have a disk-shaped piece of sheet metal that can be folded along 
three axes as shown in Figure 3.50. Describe and illustrate the configuration 
space of possible fold angles for the disk. 



FIGURE 3.50: The disk can be folded along each of the axes shown. 


3.43. Describe the phase space (position and momentum) for the motion of a bead 
on a circular wire. 

3.44. Determine the configuration space for the position and orientation of a water 
molecule, H 2 O, in the plane. 










Chapter 4 


Continuous Functions and 
Homeomorphisms 

Now that we have defined various concepts related to topological spaces and 
their subsets, we would like to consider functions sending one topological 
space to another. The concept of continuity of functions is one of the most 
important in topology. 

A topology on a set is a structure that establishes a notion of proximity on 
the set. Continuous functions between topological spaces preserve proximity, 
reflecting the idea that a continuous function sends points that are close in 
one space to points that are close in the other. In Section 4.1, we give a 
topological definition of continuity and we present results and examples related 
to continuity. 

A continuous bijective function that has a continuous inverse is called a 
homeomorphism. Such functions provide us with the main notion of topolog¬ 
ical equivalence. In Section 4.2, we introduce and explore homeomorphisms 
and topological equivalence. 

We close this chapter in Section 4.3 with a discussion of the forward kine¬ 
matics map, a continuous function that plays an essential role in the study of 
mechanical systems. 

4.1 Continuity 

Our first formal exposure to continuity usually comes in a calculus or analysis 
course, focusing on functions mapping the real line M to itself. Typically, it is 
defined as follows: 

DEFINITION 4.1. A function /:!->!« continuous if for every x$ 6 K. 
and every e > 0, there exists a 8 > 0 such that if \x — xo\ < 8, then 

I/O*) - /(*o)l < £• 

In other words, / is continuous if for every x 0 and e, we can find a 8 so 
that if x is close enough to *o (within 5), then f(x) is close to f(x o) (within 
e). This is often called the e — 5 definition of continuity. It is an extremely 
useful concept that plays a pivotal role in analysis. 

We provide a general definition of continuity for functions that map from 
one topological space to another. This topological definition of continuity is 
very simple to state and, as we will show, is equivalent to the e — 8 definition 
for functions that map M to M. 

DEFINITION 4.2. Let X and Y be topological spaces. A function f : X -> Y 
is continuous if /“’ (V) is open in X for every open set V in Y. 

We call this the open set definition of continuity. Paraphrased, it states 
that / is continuous if the preimage of every open set is open. 
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EXAMPLE 4.1. Let X = [a, b, c, d) and Y = {1, 2, 3} have the topologies 
as depicted in Figure 4.1. Let functions f, g,h : X -*■ Y be defined by 

f{a) = 1, f(b) = 1, f(c) = 2, f(d ) = 2, 

g(a) = 2, g(b ) = 2, g(c ) = 1, g(<0 = 3, 

ft (a) = 1, ft(ft) = 2, ft(c) = 2, ft(d) = 3. 

The function / is continuous, as can be easily verified by checking that 
the preimage of each open set in Y is open in X. Similarly, g is continuous. 
However, ft is not continuous because {2} is open in Y, but ft -1 ({2}) = [b, c} 
is not open in X. 



abed 


1 2 3 


FIGURE 4.1: Functions /, g, and h map X to Y. 


EXAMPLE 4.2. Let Zand Y be topological spaces. 

(i) The identity function id : X -» X, given by id(x) = x, is contin¬ 
uous. 

(ii) Pick yo e Y, and consider the constant function, C : X Y, 
defined by C(x) = yo for every x e X. We show that C is 
continuous. Suppose that V is open in Y; then C -1 (V) = X if 
yo e V, and C“*(Y) = 0 if yo ^ V. In either case C _1 (V) is 
open in X, and therefore C is continuous. 


It is not just the formula or the relation defining a function that determines 
whether or not the function is continuous; the topology on the domain and the 
topology on the range are also significant. Consider the following example. 


EXAMPLE 4.3. Let / : R -► R/ c be given by f(x) = , where 

the domain IR has the standard topology and the range R/ c is the real line 
in the finite complement topology. The function / is continuous since if 
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V = E — {*1 ,..., x m ] is an open set in the finite complement topology, then 
f~HV) = K — {2xi + 1,... , 2x m + 1}, and this is an open set in K with the 
standard topology. 

Now let g : M —► R/ be given by g(x) = where the domain R has the 
standard topology and the range M/ is the real line in the lower limit topology. 
The function g is expressed by the same formula as /, but g is not continuous. 
The set [0, 1) is open in M/, but g _1 ([0, 1)) = [1,3) is not open in M with the 
standard topology. 


EXAMPLE 4.4. Let X be a topological space and Y be a subspace of X. The 
inclusion function i : Y -»• X, given by i(y) = y, is continuous. Why? If 
we take an open set U in X, then = U flf, and U n Y is an open 

set in Y by the definition of the subspace topology on Y. In fact, the subspace 
topology on Y is the coarsest topology on Y for which the inclusion function 
i is continuous. (See Exercise 4.5(a).) 

Let p : X A be a surjective function from a topological space X to. a set 
A, and let A have the quotient topology induced by p. Then p is continuous. 
Why? By the definition of the quotient topology, a set V is open in A if and 
only if p~ 1 (V) is open in X. Therefore, the preimage of every open set in A 
is automatically open in X. In fact, the quotient topology on A is the finest 
topology on A for which p is continuous. (See Exercise 4.5(b).) 


By definition, a function / : X -*■ Y is continuous if the preimage of 
every open set in Y is open in X. However, checking that every open set in Y 
has an open preimage in X is more than we really need to do. As the following 
theorem indicates, to prove that / is continuous, it suffices to consider only 
the sets in a fixed basis for Y, showing that the preimage of each basis element 
is open in X. 

THEOREM 4.3. Let X and Y be topological spaces and B be a basis for the 
topology on Y. Then f : X —> Y is continuous if and only if f~ l (B) is open 
in X for every B € B. 

Proof. Suppose / : X -» Y is continuous. Then / -1 (V) is open in 
X for every V open in Y. Since every basis element B is open in Y, it 
follows that f~ x {B) is open in X for all B e B. 

Now, suppose f~ x (B) is open in X for every B e B. We show that 
/ is continuous. Let V be an open set in Y. Then V is a union of basis 
elements, say V = U B a . Thus, 

r\v) = r\vB a ) = ur\B a ). 

By assumption, each set f~ l (B a ) is open in X\ therefore so is their 
union. Thus, / _1 (V) is open in X, and it follows that the preimage of 
every open set in Y is open in X. Hence, / is continuous. ■ 
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EXAMPLE 4.5. The functions f(x ) = x + 2, g(x) = 2x, and h(x) = x 2 are 
all continuous functions mapping R to R in the standard topology. 

Why? Let (a, b), with a < b, be a basis element for the standard topology. 
Then 

f~H(a,b)) = (a — 2,b — 2), 

g~\(a,b)) = 

(—Vb, —■s/a) U {s/a, Vb) if a > 0, 
h~ l ((a,b )) = {—s/b,y/b) ifa<0and&>0, 

0 if b < 0. 

In each case, the preimage of an arbitrary basis element is an open set. Thus, 
each function is continuous. 


EXAMPLE 4.6. Multiplication is a continuous function. That is, the function 
/ : R 2 -> R, given by / (jc, y) = xy, is continuous. To see this, take a basis 
element (a, b ) in the range K. Its preimage in R 2 is the set 

b )) = {(*, y) € R 2 | a < xy < b). 

This is a set sandwiched between two hyperbolas. An example, with a and b 
both positive, is illustrated in Figure 4.2. 



FIGURE 4.2: The preimage f~ l ((a,b)) = {(jc, y) e R 2 |a < xy < b] with a and b 
both positive. 

It is intuitively clear that f~ ] ((a,b)) is an open subset of R 2 . We can 
prove this explicitly by showing that for each point (p, q) € b)), 

there is an open square centered at ( p, q) and contained in f~ l ((a, b)). (See 
Figure 4.2.) In fact, if we set m = min{fr — pq, pq — a} and choose S > 0 
so that &\p\, 8\q\, and 3<$ 2 are all less than m/3, then it can be shown that 
(p — 8, p + 8) x (q — 8, q + 8) C / _1 ((a, b)). (See Exercise 4.15.) 
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The addition function on the real numbers is also continuous. (See Exer¬ 
cise 4.14.) Combining the continuity of multiplication and the continuity of 
addition, we can obtain the following theorem: 

THEOREM 4.4. Let M have the standard topology. Then every polynomial 
function p : R —» R, with p(x) = a„x n + ... + a\x + ao, is continuous. 

Proof. See Exercises 4.13-4.17. ■ 

The open set definition of continuity, and results based on it, can be used 
to prove that rational functions, trigonometric functions and their inverses, 
exponential functions, and logarithmic functions are all continuous on domains 
where they are defined. We do not prove these results here, but throughout the 
rest of the text we assume them. 

The topological definition of continuity is very simple to state, but it might 
leave you wondering how the definition reflects the idea that a continuous 
function preserves proximity. The following theorem provides some insight, 
indicating that a continuous function maps a point in the closure of a set to a 
point in the closure of the image of the set. 

THEOREM 4.5. Let f : X -» Y be continuous and assume that A C X. If 
x € Cl(A), then f(x) e Cl( f(A)). (See Figure 4.3.) 









f 





X 





Y 


FIGURE 4.3: If x € CI(A), then /(*) € CI(/(A» 

Proof. Suppose that / : X -> Y is continuous, x e X, and A C X. 
We prove that if f(x) $ C1(/(A)), then x g C1(A). Thus suppose that 
f(x) <£ Cl (/(A)). By Theorem 2.5 there exists an open set U containing 
f(x), but not intersecting /(A). It follows that f~ 1 (U) is an open set 
containing x that does not intersect A. Thus x & C1(A), and the result 
follows. ■ 

Next we consider the equivalence between the open set definition of con¬ 
tinuity and the e — 8 definition of continuity in the case of functions that map 
]R to 1R. A translation of the e — 5 definition into topological terms might read 
as follows: 

Translation of the e — <5 Definition: Let X and Y be topological spaces. A 
function / : X -> Y is continuous if, for every x e X and every open set 
U containing /(jc), there exists a neighborhood V of x such that f(V) c U. 
(See Figure 4.4.) 
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FIGURE 4.4: For every U containing f(x), there exists V containing .r such that 
fiV ) c U. 

In the translation, the open set U is playing the role of the e-interval in the 
e — 8 definition, and the open set V is playing the role of the 5-interval. It is 
straightforward to show that for functions / : R -»• E, the e — 8 definition 
of continuity is equivalent to the translation. (See Exercise 4.3.) In the next 
theorem, we show that the translation is equivalent to the open set definition. 

THEOREM 4.6. A function f : X —>• Y is continuous in the open set def¬ 
inition of continuity if and only if, for every x G X and every open set U 
containing fix), there exists a neighborhood V of x such that f(V) C U. 

Proof. First, suppose that the open set definition holds for functions 
f : X -*■ Y. Let x G X and an open set U C Y containing fix) be 
given. Set V = f~ l (U). It follows that x € V and that V is open in X 
since / is continuous by the open set definition. Clearly /(V) c U, and 
therefore we have shown the desired result. 

Now assume that for every x e X and every open set U containing 
fix), there exists a neighborhood V of x such that f(V ) c U. We show 
that / _I (W) is open in X for every open set W in Y. Thus let W be an 
arbitrary open set in Y. To show that f~ l (W) is open in X, choose an 
arbitrary x g f~ l (W). It follows that fix) g W, and therefore there 
exists a neighborhood V x of x in X such that / {V x ) c W, or, equivalently, 
such that V x c f~ ] (W). Thus, for an arbitrary x € f~ l (W) there exists 
an open set V x such that x e V x c f~ l (W). Theorem 1.4 implies that 
is open in X. ■ 

Another sense in which continuity preserves proximity is demonstrated 
by the fact that continuous functions map convergent sequences to convergent 
sequences, as the following theorem indicates: 

THEOREM 4.7. Assume that f : X —>• Y is continuous. If a sequence 
(jci , xi,...) inX converges to a point x, then the sequence (f(x\), f(xf ),...) 
in Y converges to fix). 

Proof. Let U be an arbitrary neighborhood of f(x) in Y. Since / is 
continuous, f~ x {U) is open in X. Furthermore, fix) G U implies that 
x G f~' [ iU). The sequence (x\ , xi,...) converges to x\ thus, there 
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exists N € Z+ such that x„ € / _1 (t/) for all n > N. It follows that 
f(x n ) € U for all n > N, and therefore the sequence (/(xi), /(X 2 ),...) 
converges to /(x). ■ 


IMPORTANT NOTE: A continuous Junction does not necessarily map open 
sets to open sets. For example, the function f : R —»■ R, given by /(x) = x 2 , 
is continuous, but the image of the open set {— 1, 1) is [0, 1), which is not open. 

The following theorem indicates that the open set definition of continuity 
has an equivalent closed set version. 

THEOREM 4.8. Let X and Y be topological spaces. A function f : X -* Y 
is continuous if and only if f~ l (C) is closed in X for every closed set C GY. 

Proof. See Exercise 4.2. ■ 

It is straightforward to show that compositions of continuous functions are 
continuous. Specifically, we have the following theorem: 

THEOREM 4.9. Let f : X -» Y and g : Y Z be continuous. Then the 
composition function, g o / : X Z, is continuous. 

Proof. Suppose that f : X -> Y and g : Y -> Z are continuous, 
and let U be an open set in Z. Then (g o f)~ l (U) = f~ i (g~ l (U)). 
Since g is continuous, g -1 (U) is open in Y, and since / is continuous, 
f~ l (g~ l (U)) is open in X. Thus, (g o f)~ l (U ) is open in X for an 
arbitrary U open in Z, implying that g o / is continuous. ■ 


We will find the following lemma useful. It is called the Pasting Lemma 
because it provides conditions under which we can paste together continuous 
functions / : A Y and g : B -*■ Y to obtain a continuous function, 
h : A U B —► Y, defined on the union of the domains of / and g. (See 
Figure 4.5.) 



FIGURE 4.5: The functions f : A -*■ Y and g : B -> Y paste together to define 
A:AUfi->F. 
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LEMMA 4.10. The Pasting Lemma. Let X be a topological space and let 
A and B be closed subsets ofX such that A U B = X. Assume that f : A Y 
and g : B -> Y are continuous and f (jc) = g{x) for all x in A H B. Then 
h : X —y Y, defined by 


h{x) = 


| f{x) ifx € A, 
\ g(x) ifx € B, 


is a continuous function. 

Proof. By Theorem 4.8, it suffices to show that if C is closed in Y, 
then h~ l (C) is closed in X. Thus suppose that C is closed in Y. Note 
that h~ l (C) = f-\C) U g -1 (C). Since / is continuous, it follows by 
Theorem 4.8 that f~ l (C) is closed in A. Theorem 3.4 then implies that 
/“* (C) = D n A where D is closed in X. Now, D and A are both closed 
in X, and f~ l (C) = D HA; therefore, /“’ (C) is closed in X. Similarly, 
g _l (C) is closed in X. Thus, h~ l (C) is the union of two closed sets in 
X and therefore is closed in X as well. It follows that h is continuous. 


EXAMPLE 4.7. The absolute value function h : R ->■ E, given by 


h{x) = |*! = 


x if x > 0, 
—x if x < 0, 


consists of two continuous functions defined on closed subsets of E. One is the 
function / : (—oo, 0] ->• K, given by f{x) = —jc, and the other is the function 
g : [0, oo) -» E, given by g(jc) = jc. These two functions are pieced together 
at jc = 0, where they agree. Hence, the resulting function h is continuous. 


Exercises for Section 4.1 

4.1. (a) Let X have the discrete topology and Y be an arbitrary topological space. 

Show that every function / : X —> Y is continuous. 

(b) Let Y have the trivial topology and X be an arbitrary topological space. 
Show that every function / : X -* Y is continuous. 

4.2. Prove Theorem 4.8: Let X and Y be topological spaces. A function /: X -> Y 
is continuous if and only if f~ l (C) is closed in X for every closed set C c T. 

4.3. Show that a function / : R -> R is continuous in the e — S definition of 
continuity if and only if, for every jc € R and every open set U containing 
/(jc), there exists a neighborhood V of jc such that f(V) c U. 

4.4. Prove whether or not each given function is continuous. 

(a) The function / : M; -»• R, defined by /(jc) = 3jc — 5, where R; is the 
real line with the lower limit topology. 

(b) The function g : R/ c —»■ M, defined by g(x) = 3x — 5, where R/ c is the 
real line with the finite complement topology. 
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4.5. (a) Let X be a topological space, and let Y be a subset of X. Show that 

the subspace topology on Y is the coarsest topology on Y for which the 
inclusion function, i : Y -» X, defined by i(y) = y, is continuous. That 
is, show that if T is the subspace topology on Y, and T' is a topology on 
Y such that i is continuous, then T c T. 

(b) Let p : X —> A be a suijective function from a topological space X to 
a set A. Show that the quotient topology on A induced by p is the finest 
topology on A for which the function p is continuous. That is, show that 
if T is the quotient topology on A, and T' is a topology on A such that p 
is continuous, then T' C T. 

4.6. (a) Let X and Y be topological spaces, and let X x Y be the corresponding 

product space. Define the projection functions 

px '■ X x Y -> X and py : X x Y -*■ Y 

by Px(x, y) = x and py(x, y ) = y. Prove that px and py are continuous, 
(b) Show that the product topology on X x Y is the coarsest topology on 
X x Y for which both functions px and py are continuous. That is, show 
that if T is the product topology on X x Y , and T is a topology onXxf 
such that px and py are continuous, then T c T'. 

4.7. Suppose X is a space with topologies T\ and 72. Let id : X -+ X be the 
identity, id(x) = x, and assume that the domain X has topology 7j and that 
the range X has topology Show that id is continuous if and only if T\ is 
finer than 72. 

4.8. Let f : X -*■ Y be a continuous function. If x is a limit point of a subset A 
of X, is it true that f(x) is a limit point of /(A) in T? Prove this or find a 
counterexample. 

4.9. Let f, g : X -*■ Y be continuous functions. Assume that Y is Hausdorff and 
that there exists a dense subset D of X such that f{x) = g(x) for all x e D. 
Prove that f(x) = g(x) for all x e X. 

4.10. Let / : X —> Y be a function. The graph of / is the subset of X x Y given by 
G = {(jc, /(*)) | x e X}. Show that, if / is continuous and Y is Hausdorff, 
then G is closed in X x Y. (Note: In Exercise 7.13 we consider a converse of 
this result, assuming Y also satisfies a property known as compactness.) 

4.11. Let / : X -> Y be continuous, and let A be a subspace of X. Prove that 
f\A : A —> Y, the restriction of / to A, is continuous. 

4.12. (a) Prove that the Pasting Lemma holds if we replace the assumption that A 

and B are both closed in X with the assumption that A and B are both 
open in X. 

(b) Provide an example showing that the Pasting Lemma does not hold if we 
drop the assumption that A and B are both closed in X. 

4.13. (a) Let f\ : X -*■ Y\ and /j : X -* Y% be continuous functions. Show that 

h : X -> Yj x I 2 , defined by h(x) = (/i(x), / 2 U)), is continuous as 
well. 

(b) Extend the result of (a) to n functions, for n > 2. 

4.14. Show that the addition function, / : R 2 -*■ R, given by f(x, y) = x + y, is a 
continuous function. 
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4.15. Let / be the multiplication function, /( x , y) = xy. Complete the proof of the 
continuity of / that was outlined in Example 4.6, by doing the following: 

(a) Show that if p and q are both positive, and 8 is as described in the example, 
then (p - 8, p + 8) x (q - 3, q + 8) C f~ x ((a,b)). 

(b) Consider the rest of the possibilities for p and q being positive or negative, 
and show that (p — 8, p + 8) x (q — 8,q + 8) C / -1 ((a, b)). (Note: 
You can argue by symmetry here. For example, the situation for p and q 
both negative is symmetric about the origin to the situation for p and q 
both positive.) 

4.16. Use Example 4.6, Exercises 4.13 and 4.14, and Theorem 4.9 to show that the 
sum and product of a finite number of continuous functions are also continuous 
functions. That is, assuming that f \,..., f m : R -* R are continuous, prove 
that S : R —► R and P : R -> R, defined by S(x) = f\(x) + ... + f m (x) and 
P(x) = f l (x)f 2 (x)...f m (x), are continuous. 

4.17. Use Exercise 4.16 to show that every polynomial function p : R -»• R, given 
by p(x) = a n x n + ... + a\x + ao, is continuous. 


Supplementary Exercises: Sequences of Functions 

Sequences of functions play an important role in topology and analysis. For example, 
functions that are defined by power series, such as sin(x) and e x , can be shown to 
be continuous because they are limits of converging sequences of polynomials, which 
themselves are continuous functions. In this text, we employ a convergent sequence 
of functions in the proof of the Tietze Extension Theorem, which appears in a set of 
supplementary exercises in Chapter 7. 

In these exercises, we consider conditions that ensure that a sequence of continuous 
functions converges to a continuous function. To begin, consider the sequence of 
functions ( s n ) where, for each n = 1 , 2,3, ..., the function s n : [0,1] -* [0,1] is 
defined by 





ifO < x < i, 
if £ < x < 1. 


SE4.18. Sketch graphs of si, s 2 , and s 2 , and prove that s n is continuous for all 
n = 1,2,3. 

DEFINITION 4.11 . Let X and Y be topological spaces. A sequence (/„) of functions 
f n :X->-Y is said to converge (pointwise ) to a function f : X -+ Y if for each 
x e X, the sequence (/„(x)) converges in Y to f(x). 

SE 4.19. The sequence ( s n ) converges to a function s : [0,1] -> [0,1]. Determine s, 
and prove that s is not continuous. 

Thus, a sequence of continuous functions does not necessarily converge to a continuous 
function. However, we are guaranteed that the limit function is continuous if the 
individual sequences (/„(x)) converge at a uniform rate. This is made explicit for the 
special case where the range is R, in the following definition: 
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DEFINITION 4.12. A sequence of functions f„ : X —*■ K is said to converge 
uniformly to f : X ->■ R if for every e > 0 there exists N € Z + such that 
| f„(x) — f(x)\ < e for every x € X andn > N. 

We can visualize the idea behind uniform convergence by considering the situation 
graphically. If (/„) converges uniformly to /, then for every band of vertical thickness 
2e, centered on the graph of /, there exists N e Z+ such that for every n > N, the 
graph of f n lies within the band. (See Figure 4.6.) 



FIGURE 4.6: The graph of f n lies within e of the graph of /. 

SE 4.20. Prove that the previously defined sequence of functions (s n ) does not converge 
uniformly. 

THEOREM 4.13. The Uniform Convergence Theorem. If if „) is a sequence 
of continuous functions /„ : X —> R that converges uniformly to f : X -> R, then f 
is continuous. 

Proof. Let U c R be an open set. We prove that for each x e f~ l (U) there 
exists an open set V x c X such that x e V x c / -1 (U). Let e > 0 be such that 
(f(x*) — e, fix*) + e) C U. By uniform convergence, we can pick /V e Z + 
such that \ fn(x) — fix) \ < | for every n > N and x e X. Pick n' > N. Let 
u' = (fn'(x) - f, fn'ix) + |), and let V x = f~ l (U'). We claim that V x is 
open in X , contains x, and satisfies f(V x ) c U. (See Exercise SE 4.21.) Given 
the claim, it follows that for every x e U, there exists an open set V x in X such 
that re^c f~ l (U). Therefore U is open in X, and / is continuous. ■ 


SE 4.21. In the proof of Theorem 4.13 verify that V x is open in X, contains x, and 
satisfies f(V x ) c U, as claimed. 


4.2 Homeomorphisms 

In this section we define homeomorphism, a mapping between topological 
spaces that provides the most fundamental notion of topological equivalence. 
Homeomorphisms preserve all the properties given by a topology, and thereby 
define a correspondence between points and between open sets in two topo¬ 
logical spaces. 
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DEFINITION 4.14. Let X and Y be topological spaces, and let f : X Y 
be a bijection with inverse f~ l : Y —> X. If both f and f~ l are continuous 
functions, then f is said to be a homeomorphism. If there exists a home- 
omorphism between X and Y, we say that X and Y are homeomorphic or 
topologically equivalent, and we denote this by X = Y. 

Let f : X -*■ Y be a bijective function. For / -1 : Y ->■ X to be 
continuous, it must be true that ( f~ l )~ l (U ) is open in Y for every open set U 
inX. But since / is a bijection, (/ _1 ) _1 (I7) = f(U) for U C X. Thus/(t/) 
must be open in Y for every open set U in X. Therefore, saying that /”' is 
continuous when / is a bijection is equivalent to saying that the image of every 
open set under / is an open set. Similarly, saying that / is continuous when 
/ is a bijection is equivalent to saying that the image of every open set under 
the inverse function, f~ l , is an open set. (See Figure 4.7.) 

We can paraphrase the definition of homeomorphism by saying that / is a 
homeomorphism if it is a bijection on points and a bijection on the collections 
of open sets making up the topologies involved. Every point in X is matched 
to a unique point in Y, with no points in Y left over. At the same time, every 
open set in X is matched to a unique open set in Y, with no open sets in Y left 
over. (See Figure 4.7.) 



FIGURE 4.7: A homeomorphism sends open sets to open sets. 

If you have taken an abstract algebra course, you have likely encountered 
the concept of a group isomorphism, which is an equivalence between groups. 
A group isomorphism is a bijection on the elements of the groups that preserves 
the algebraic structure given by the multiplication operation on each group. In 
a similar way, a homeomorphism is a bijection on topological spaces that 
preserves the topological structure given by the collection of open sets in each 
topological space. 

The expression “topologically equivalent,” which previously we used in¬ 
formally, has now been made precise by Definition 4.14. 


EXAMPLE 4.8. Let X and Y be the topologies on the three-point sets shown 
in Figure 4.8. Define / : X Y by /(a) = 1, f(b) = 2, /(c) = 3. Then 
/ is a homeomorphism since it is a bijection on points and a bijection between 
the open sets in X and Y. 
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X 


f 




FIGURE 4.8: A homeomorphism from X to Y. 

_i 


EXAMPLE 4.9. The two topologies on the three-point set X = {a, b,c}, 
shown in Figure 4.9, are different as collections of subsets of X but are topologi¬ 
cally equivalent since the function / : X —> X, given by f(a) = c, fib) = b, 
f{S) = a, is a homeomorphism. 

c20O> c OQ g> 

a b c a b c 

FIGURE 4.9: Two homeomorphic topologies on X = [a, b, c). 


EXAMPLE 4.10. Define the function / : K -> R by fix) = 3x + 1. 
This function is a bijection with inverse given by f~ l ix) = (See Ex¬ 
ample 0.12.) Since / and / -1 are both polynomials on R, they are both 
continuous by Theorem 4.4. Thus, / is a homeomorphism. 

Since / is a homeomorphism, it preserves the topological properties of R. 
Stretching by a factor of three and then sliding to the right by a distance of one 
does not change the basic topological properties of the real line. 


The following basic facts about homeomorphisms together imply that topo¬ 
logical equivalence is an equivalence relation on the collection of all topological 
spaces. (See Exercise 4.28.) 

(i) The function id : X —» X, defined by idix) = x, is a homeomor¬ 
phism. 

(ii) If / : X -> Y is a homeomorphism, then so is f~ l : Y -» X. 

(iii) If / : X Y and g : Y -> Z are homeomorphisms, then so is 
gof:X^Z. 
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EXAMPLE 4.11 . Let (— 1, 1) have the standard topology, and define a func¬ 
tion / : R -* (— 1, 1) by f(x) = xqrfjj- The graph of / is shown in Fig¬ 
ure 4.10. It is apparent from the graph that / is a bijection between R and 
(— 1, 1). It is also apparent that preimages of open intervals are open intervals 
for both / and f~ x . Therefore, / and f ~ 1 are both continuous, and it follows 
that / is a homeomorphism between R and (—1,1). 



FIGURE 4.10: A homeomorphism between M and (-1,1). 
_| 


EXAMPLE 4.12. Not only is (—1, 1) homeomorphic to R, but every nonempty 
open interval (a, b) is as well. In fact, consider the following collections of in¬ 
tervals with the standard topology (assuming a and b are arbitrary real numbers 
with a < b): 

(i) Open intervals: (a, b ), (—oo, a), (a, oo), R, 

(ii) Closed, bounded intervals: [a,b], 

(Hi) Half-open intervals and closed, unbounded intervals: 


[a, b), ( a, b], (-oo, a], [a, oo). 


Within each of the collections (i), (ii), and (iii), all of the spaces are topo¬ 
logically equivalent. We illustrate example homeomorphisms in Figure 4.11 
and discuss these equivalences further here. 

The function / : R —> (a, oo), given by f(x) = e x + a, is a homeomor¬ 
phism. Thus, R is homeomorphic to every interval (a, oo). Since topological 




FIGURE 4.11: The homeomorphisms /, g, and h. 
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equivalence is an equivalence relation, it also follows that every interval (a, oo) 
is homeomorphic to every other interval in the form (a ', oo). 

The linear function g : [0, 1] -> [ a , £>], given by g(x) = (b — a)x + a, 
is a homeomorphism between [0,1] and [a, b], Therefore, every interval [a, b] 
is homeomorphic to [0,1], and, consequently, every interval [a, b] is homeo¬ 
morphic to every other closed, bounded interval [ a', b'] with a' < b'. 

Furthermore, the function h : [a, oo) —> (—oo, a'], given by h(x) = 
—x + a' + a, is a homeomorphism between the intervals [a, oo) and (—oo, a']. 
Thus, if 1\ and I 2 are intervals of either form, [a, 00 ) or (— 00 , a'], then I\ and 
I 2 are homeomorphic. 

In Exercises 4.25 and 4.26 you are asked to define homeomorphisms that 
establish the remainder of the topological equivalences in the claim that all of 
the interval subspaces in each collection are topologically equivalent. 

From one collection to another, the interval subspaces of R are not home¬ 
omorphic. Although that might seem intuitively clear, we need to develop 
further topological concepts before we can explicitly prove this. We do so in 
Chapter 6. (See Exercise 6.27.) 


EXAMPLE 4.1 3. Let [0,2 n) and S 1 have the standard topology as subspaces 
of R and R 2 , respectively. We denote each point in S' 1 by po, where po 
represents the point on S 1 at angle 8 € R, measured counterclockwise from 
the positive x-axis. Define / : [0, 2n) -> S l by f(8) = po, as illustrated in 
Figure 4.12. It is clear that / is a bijection. 



phism. 

Is / continuous? Yes. We can see this by looking at the preimage of 
an arbitrary basis element for the topology on S 1 . Such a basis element can 
be expressed in the form B = {pe e S l \ a < 8 < b and a, b e R}. We 
consider two cases for B: either po is in B or it is not, where po is the point 
corresponding to angle 6 = 0. If po & B, then f~ l (B) is an open interval 
(c, d) in [0, 2 n). If po € B, then f~ x ( B) is in the form [0, e) U (g, 2n). In 
either case, f~ x (B) is an open subset of [0, 2n) in the standard topology. It 
follows that / is continuous. 

But is f~ l continuous? No! The half-open interval [0, 1/4) is open in 
the domain [0, 27 t), but /([0, 1/4)) is not open in S l . Therefore / is not a 
homeomorphism. 
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In fact, there are no homeomorphisms between these two spaces, and there¬ 
fore they are not topologically equivalent. We will see how to show this when 
we establish some properties involving connectedness, in Chapter 6. (See 
Exercise 6.28.) 


EXAMPLE 4.14. In the standard topology on each, the plane is topologically 
equivalent to the open right half plane H = {(x, y) 6 R 2 | x > 0} and the 
open disk D = {(x, y) e R 2 | x 2 + y 2 < 1}, as illustrated in Figure 4.13. 



FIGURE 4.13: The plane is homeomorphic to the open half plane and the open 
disk. 

The function / : R 2 -» H, defined by /(x, y) — (e x , y ), is a homeomor- 
phism between R 2 and H. It maps R 2 to H, sending vertical lines to vertical 
lines, as follows: 

(i) The left half plane is mapped to the strip in H where 0 < x < 1. 

(ii) The y-axis is mapped to the line x = 1. 

(iii) The right half plane is mapped to the region in H where x > 1. 

The function g : R 2 -* D, defined (in polar coordinates) by g(r, 9) = 
(y^:, 6), is a homeomorphism between R 2 and D. It contracts the whole 
plane radially inward to coincide with the open disk D. 


EXAMPLE 4.15. The surface of a cube C is homeomorphic to the sphere 
S 2 , as illustrated in Figure 4.14. If we regard each as centered at the origin 
in 3-space, then the function / : C ->• S 2 , defined by f(p) = p/\p\, is a 
homeomorphism. As we see in the figure, / maps points in C bijectively 
to points in S 2 and maps the collection of open sets in C bijectively to the 
collection of open sets in S 2 . 
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FIGURE 4.14: The surface of a cube is homeomorphic to the sphere. 

_i 


In the next example, we see that removing a point from a sphere yields a 
space that is homeomorphic to the plane. 


EXAMPLE 4.16. In R 3 , let N — (0,0,1) be the point at the north pole in the 
sphere S 2 . Let X = S 2 — {fV}, and give X the subspace topology it inherits 
from R 3 . The space X is called a punctured sphere. We show that X is 
homeomorphic to the plane R 2 . Note that the xy-plane, as a subspace of R 3 , 
is naturally homeomorphic to R 2 , so we view R 2 as that subspace. 

We define a function / : X -» R 2 as follows. For each point p = (x,y,z) 
in X, take the ray that begins at N and passes through p. We define f(p) to be 
the unique point where the ray passes through the xy-plane. (See Figure 4.15.) 
The map / is called a stereographic projection. In Exercise 4.27 we ask you 
to derive a specific formula for /. 



FIGURE 4.15. Stereographic projection. 

It is clear from the illustration that / is a bijection and that / maps the 
collection of open sets in X bijectively to the collection of open sets in the 
plane. Hence, / is a homeomorphism from X to the plane. Thus, removing a 
point from the sphere results in a space that is topologically equivalent to the 
plane. 


In the next example we see how collapsing each end in a cylinder results 
in a space that is homeomorphic to the sphere. 
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EXAMPLE 4.17. Let X be the cylinder S 1 x [—1, 1] in 1R 3 . We create a 
quotient space X* by defining a partition of X that is made up of the following 
sets: 

(i) Each one-point set {(x, y, z)}, where z g (—1,1) and (x, y) € S 1 , 

(ii) The set T = {(x, y, 1) | (x, y) e S 1 } at the top of the cylinder, 

(iii) The set B = j(x, y, —1) | (x, y) e S 1 }, at the bottom of the 
cylinder. 

A quotient topology is defined on X* by the function p : X X* that 
takes each point in X and maps it to the set in X* containing it. 

In Figure 4.16 we illustrate X* and two example open sets in X*. In the 
identification from X to X*, the sets T and B are collapsed to individual points, 
T* and B*, respectively. An open set U in X* that does not contain T* or B* 
is essentially the same set as its corresponding open set p~ ] (U) in X. An open 
set V in X* that contains the point T*, but not the point B*, corresponds to an 
open set p -1 (V) in X that contains the subset T and is disjoint from B. We 
can determine other open sets in X* in a similar manner. 


X* 


FIGURE 4.16: The sets U and V are open sets in X*. 

We show that the quotient space X* is homeomorphic to the sphere S 2 . 
In 3-space, the cylinder surrounds the sphere, and both have the z-axis as a 
central vertical axis, as shown in Figure 4.17. Furthermore, the cylinder and 
sphere intersect in the circle of radius 1, centered at the origin in the xy-plane, 
and the north ( N ) and south (S) poles of the sphere are at the center of the top 
(T) and bottom (B) circles of the cylinder, respectively. 

To obtain a homeomorphism g : X* -* S 2 , we first define a function 
/ : X —> S 2 and then use / to define g. The function / is defined by projecting 
each point p in the cylinder X = 5 1 x [— 1,1 ] along a horizontal line, directly 
toward the z-axis, onto the point f(p ) on the sphere. We can express this 
function in cylindrical coordinates in M 3 as /(1, 6, z ) = (V1 — z 2 , 9, z). The 
function / maps the cylinder onto the sphere with the following properties, as 
illustrated in Figure 4.17: 

(i) The open subset of the cylinder, lying between the top and bottom 
circles, is mapped bijectively to the sphere minus its north and 
south poles. 
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(ii) The entire top circle of the cylinder, the set T, is mapped to the 
north pole, N. 

(iii) The entire bottom circle of the cylinder, the set B, is mapped to the 
south pole, S. 



FIGURE 4.17: The cylinder is mapped onto the sphere by /. 

Since / is constant on each of the circles T and B, and since each of these 
sets is considered a single element in X*, it follows that / induces a bijective 
function g : X* -* S 2 . 

To show that g is a homeomorphism, we check that g and g -1 map open 
sets to open sets. Let U be an open set in X*, and suppose it does not contain 
either T* or B* as points. Then g(U) is an open set in S 2 that is disjoint from 
both N and 5. (See Figure 4.18.) However, if V is an open set in X* that 
contains T*, then g maps V to an open set in S 2 that contains N. The other 
cases for open sets in X* are similar. Also, the same type of argument can be 
used to show that g” 1 maps open sets in S 2 to open sets in X*. It follows that 
g is a homeomorphism between X* and S 2 . 



FIGURE 4.18: The quotient space X* is homeomorphic to the sphere S 2 . 



EXAMPLE 4.18. In Figure 4.19 we see two bands in 3-space, each with the 
standard topology. Because we cannot perform a rubber-sheet deformation 
from one to the other, they might seem to be topologically distinct, but in fact 
they are homeomorphic. We define a map from the first band to the second 
by cutting the first band open along segment 5, untwisting the resulting strip, 
and then gluing the cut segments back together so that they match as they did 
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beforehand. The map takes a point on the first band, traces it through this 
process, and maps it to the point where it ends up on the second band. This 
map is a bijection on the bands and on the collections of open sets in the bands, 
and therefore is a homeomorphism. 



FIGURE 4.19: Two homeomorphic bands in 3-space. 


The band on the left in Figure 4.19 is not a Mobius band. It is made by 
putting a full twist in a strip before gluing the ends together, while a Mobius 
band is made by putting a half twist in a strip before gluing the ends together. 
(See Example 3.19.) 

Example 4.18 raises an important point about equivalence in topology. If 
we cut an object, we might be left with an object that is not topologically 
equivalent to the original. But, if we glue the cut back together properly, with 
points along either side of the cut matching as they did before the cut, then no 
matter how much twisting and deforming we do in the meantime, the result is 
homeomorphic to the original. 

Although homeomorphism represents the most fundamental equivalence 
between two topological spaces, it is not the equivalence most people envision 
when they think of topology as rubber-sheet geometry. As already indicated, 
the two bands appearing in Figure 4.19 cannot be deformed from one to the 
other as rubber sheets, but they are homeomorphic. The rubber-sheet notion 
of equivalence is made precise in Chapter 12 via what is known as isotopy, a 
continuous, parameterized family of homeomorphisms. 

There is a distinct difference in how the bands in Example 4.18 sit in 3- 
space. We can see this difference by examining the bands’ edges, the curves 
E\ and £2 shown in Figure 4.20. On the left side of the figure, the edges are 
linked, while on the right side they are not. This idea will be formalized in the 
concept of linking number, which we introduce in Chapter 12. 




e 2 



FIGURE 4.20: The edges E\ and E 2 are linked on the left, but not on the right. 
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The idea of one topological space sitting inside another is captured by the 
following definition: 

DEFINITION 4.15. An embedding of X in Y is a function f : X -*■ Y that 
maps X homeomorphically to the subspace f(X) in Y. 

We think of an embedding / : X -» Y as placing a copy of X in Y, as 
shown in Figure 4.21. 



FIGURE 4.21: Embedding X in Y. 

Each of the bands appearing in Figure 4.19 can be regarded as the image 
of an embedding of an annulus in 3-space. While the bands are homeomorphic 
as subspaces of R 3 , we will see that file corresponding embeddings are distinct 
once we define a notion of equivalence between embeddings, in Chapter 12. 

DEFINITION 4.16. Let X be a topological space. If f : [—1, 1] —>• X is an 
embedding, then the image of f is called an arc in X, and if f : S 1 —>■ X is 
an embedding, then the image of f is called a simple closed curve in X: (See 
Figure 4.22.) 



In Figure 4.23, we depict some simple closed curves in 3-space. Em¬ 
beddings / : S 1 —► R 3 , naturally enough, are referred to as knots and are 
the subject of the study of knot theory. We discuss embeddings further in 
Chapter 11 and knot theory in Chapter 12. 

Certain properties defined for topological spaces are preserved by homeo¬ 
morphisms. An example is provided by the next theorem. 
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THEOREM 4.17. Iff : X —> Y is a homeomorphism and X is Hausdorjf, 
then Y is Hausdorjf. 


Proof. Suppose that X is Hausdorff and / : X -*■ Y is a homeomor¬ 
phism. Let x and y be distinct points in Y. Then f~ x (x) and f~ l (y) 
are distinct points in X. Thus, there exist disjoint open sets U and V 
containing / -1 ( x) and f~ l (y), respectively. It follows that f(U) and 
f(V) are disjoint open sets containing x and y, respectively. Therefore 
Y is Hausdorff. ■ 


Theorem 4.17 implies that R in the standard topology is not homeomorphic 
to R in the finite complement topology because the former is Hausdorff while 
the latter is not. 

A property of topological spaces that is preserved by homeomorphism is 
said to be a topological property. Theorem 4.17 implies that being Hausdorff 
is a topological property. In general, properties that are defined in terms of 
open sets, like Hausdorff, are topological properties. 

As we have seen in a few examples in this section, we can prove that two 
topological spaces are homeomorphic by defining a homeomorphism between 
them. Conversely, in order to prove that two topological spaces are not home¬ 
omorphic, it is necessary to show that none of the functions defined between 
them is a homeomorphism. However, it is often too difficult to consider every 
function between the spaces. Instead, as in our argument that R in the standard 
topology is not homeomorphic to R in the finite complement topology, we find 
a topological property held by one space but not the other. Then we know that 
the two spaces are not homeomorphic. 

Exercises for Section 4.2 

4.22. Consider all of the possible topologies on the two-point set X = {a,b}. Indicate 
which ones are homeomorphic. 

4.23. Find three different topologies on the three-point set X = {a, b, c}, each con¬ 
sisting of five open sets (including X and 0), such that two of the topologies 
are homeomorphic to each other, but the third is not homeomorphic to the other 
two. 

4.24. Prove that a bijection f : X -* Y is a homeomorphism if and only if / and 

map closed sets to closed sets. 
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4.25. (a) Provide a formula for a homeomorphism between R and the interval 

(—oo, a). 

(b) Provide a formula for a homeomorphism between R and the interval (a , b) , 
with a < b. 

(c) Given the homeomorphisms in Example 4.12 and the first two parts of this 
exercise, prove that if I\ and h are intervals in collection (i) in Exam¬ 
ple 4.12, then I\ and h are topologically equivalent. 

4.26. (a) Provide a formula for a homeomorphism between the intervals [0, oo) and 

[a, b ), with a < b. 

(b) Provide a formula for a homeomorphism between the intervals (—oo, 0] 
and ( a , b\, with a < b. 

(c) Given the homeomorphisms in Example 4.12 and the first two parts of 
this exercise, prove that if I\ and h are intervals in collection (iii) in 
Example 4.12, then I\ and I 2 are topologically equivalent. 

4.27. Provide an explicit formula for the stereographic projection function in Exam¬ 
ple 4.16. 

4.28. Prove each of the following statements, and then use them to show that topolog¬ 
ical equivalence is an equivalence relation on the collection of all topological 
spaces: 

(a) The function id : X -*■ X, defined by id(x) = x, is a homeomorphism. 

(b) If / : X -»■ Y is a homeomorphism, then so is /“’ : Y -» X. 

(c) If / : X -*■ Y and g : Y -» Z are homeomorphisms, then so is the 
composition g o / : X -*■ Z. 

4.29. Show that R 2 - {O} in the standard topology is homeomorphic to S 1 x R. 

4.30. Find two distinct topologies on R such that the first is strictly finer than the 
second but the two of them are homeomorphic with each other. 

4.31. (a) Show that having nonempty open sets that contain finitely many points is 

a topological property. 

(b) Prove that the digital line is not homeomorphic to Z with the finite com¬ 
plement topology. 

4.32. Show that homeomorphism preserves interior, closure, and boundary as indi¬ 
cated in the following implications: 

(a) If / : X —► Y is a homeomorphism, then /(Int(A)) = Int(/(A)) for 
every A cl 

(b) If f : X -*■ Y is a homeomorphism, then /(C1(A)) = C1(/(A)) for every 
A c X. 

(c) If / : X —> Y is a homeomorphism, then /(3(A)) = 3(/(A)) for every 
A c X. 

4.33. Let X x Y be partitioned into subsets of the form X x {y} for all y in Y. If we 
let (X x Y)* denote the collection of sets in the partition, show that (X x Y)* 
with the resulting quotient topology is homeomorphic to Y. 

4.34. Let X, Y , and Z be topological spaces. Prove that the three product spaces 
(X x T) x Z, X x (Y x Z), and X x Y x Z are homeomorphic to each other. 

4.35. In Chapter 3 we introduced three different representations of the torus: 

(i) The definition of the torus as a subspace of R 3 in Example 3.5, 

(ii) The product space S l x S 1 in Example 3.10, and 
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(iii) A quotient space obtained by gluing edges of a square in Exam¬ 
ple 3.20. 

Verify that these spaces are topologically equivalent to each other by defining 
homeomorphisms between them. 

4.36. In this exercise we establish that products of homeomorphic spaces are home- 
omorphic. Let f : X —* Y and g : X' -*■ Y' be homeomorphisms. Prove that 
the function h : X x X' -*■ Y x Y', defined by h{x, x') = (f(x), g(x')), is a 
homeomorphism. 

4.37. Begin with a collection of topological graphs representing the letters of the 
alphabet. (See Exercise 3.34.) By visual inspection, determine which topolog¬ 
ical graphs in the collection are homeomorphic and which are not, then group 
them into equivalence classes by topological equivalence. (Note: In Chapter 6 
we introduce properties that enable us to verify that topological graphs from 
the different equivalence classes in this exercise are not homeomorphic—see 
Exercise 6.31.) 


4.3 The Forward Kinematics Map in Robotics 

In the field of robotics, there is a naturally defined continuous function, called 
the forward kinematics map, which plays a role in motion planning for link¬ 
ages, robot arms, and other similar mechanisms. In this section we present an 
introduction to the forward kinematics map by considering a few examples and 
examining some interesting properties and problems involving this function. 

Recall the linkage presented in Example 3.29, where we had two rods, 
pictured again in Figure 4.24. We assume throughout this section that rod B is 
shorter than rod A. The configuration space for the system is the torus, S 1 x S 1 , 
where the first S 1 corresponds to the circle of angles 6a through which rod A 
can turn about its fixed end, and the second S 1 corresponds to the circle of 
angles 6b through which rod B can turn about its end that is fixed to rod A. 



FIGURE 4.24: A two-rod linkage. 

The operational space for the linkage is the space traced out by the end 
of rod B. In this particular situation, we can picture the linkage as a drawing 
device, and the operational space as the set of all points we can color with a 
pen at the end of rod B. The resulting operational space is an annulus. (See 
Figure 3.40.) 
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In general, in studying the design of a machine or robot arm, we are inter¬ 
ested in a particular point on the mechanism where there is a tool that serves a 
specific function, such as spraying paint, picking up a part, drilling a hole, and 
so on. This point is called the end effector, and the operational space is the 
space traced out by the end effector. For our purposes, it suffices to view the 
end effector as a point where there is a pen on a drawing device. 

To each point in the configuration space of a mechanism, we associate the 
corresponding end-effector point in the operational space. Thus, a function / 
is defined; it is called the forward kinematics map for the mechanism. It is 
natural to assume that the forward kinematics map is continuous because points 
close together in the configuration space correspond to points close together 
in the operational space. In the case of our two-rod linkage, / is a continuous 
map from the torus to the annulus, as shown in Figure 4.25. 



FIGURE 4.25: The forward kinematics map for the two-rod linkage. 

An important question in the field of robotics is whether a given path in 
the operational space of a mechanism can be traced by the end effector. In our 
example, we can ask if there is a way to manipulate the configuration space 
variables Oa and 6b to yield a given path in the operational space annulus. 
(See Figure 4.26.) In other words, can the linkage draw a given curve in the 
annulus? 



FIGURE 4.26: Can the linkage trace out the given path? 

Let us make this idea more mathematically precise. 

DEFINITION 4 . 18 . Let Y be a topological space, and let [0, 1] C M have the 
standard topology. A path in Y is a continuous function p : [0,1] ->• Y. We 
say that the path begins at p{ 0) and ends at p( 1). (See Figure 4.27.) 
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FIGURE 4.27: A path in Y. 


We can think of a path as a trajectory in Y, traced from time 0 to time 1. 
While a path is a function, we often use the term path to refer to the image of 
a path. 

Suppose that we have a linkage with configuration space C, operational 
space O, and forward kinematics map f : C -*■ O. Given an operational 
space path p : [0,1] —> O, our path-tracing question now asks: Is there 
a configuration space path q : [0,1] —► C such that p = f o ql (See 
Figure 4.28.) If such a q exists, we say that q traces p. 



0 1 


FIGURE 4.28: The path q traces path p if p = f oq. 

To say that q traces p means that there is a way to move the linkage, 
specified by the path q in the configuration space, to obtain the desired path 
p in the operational space. The problem of finding a configuration space 
path corresponding to a given operational space path is known as an inverse 
kinematics problem. 


EXAMPLE 4 . 19 . Consider the two-rod linkage shown in Figure 4.24. We 
represent the configuration space torus as a square with its edges identified as 
in Section 3.3. The horizontal direction on the square corresponds to angle 6 a, 
and the vertical direction corresponds to 9b- 

Let p be a circular path in the operational space, as shown in Figure 4.29. 
Then p can be traced by a configuration space path q with 6a fixed at 6* and 
9g running from 0 to 27r. 
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FIGURE 4.29: Configuration space path q traces out operational space path p. 


Now let p be a circular path around the center of the operational space 
annulus, as shown in Figure 4.30. In this case, p is traced by a configuration 
space path q with 9b fixed at 9* and 9a running from 0 to 27r. 



FIGURE 4.30: Configuration space path q traces operational space path p. 

Consider the two operational space paths shown in Figure 4.31. Path p\ is 
a circular path that is tangent to the inner and outer circles of the annulus. Path 
P 2 follows two circular arcs that have a radius equal to the length of rod B. 
In Exercise 4.38 we ask you to find configuration space paths that trace these 
operational space paths. 



FIGURE 4.31: Are there configuration space paths that trace the paths shown? 
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While it might be possible to find a configuration space path that enables a 
mechanism to follow a specified path in the operational space, sometimes the 
process results in problematic configurations of the mechanism. In the next 
example, we see such a situation and how it impacts spaceflight navigational 
systems. 


EXAMPLE 4.20. Consider the mechanism illustrated in Figure 4.32. Sup¬ 
pose that the components of the mechanism can rotate as indicated in the fig¬ 
ure. The rotating components are called gimbals, and since each one can rotate 
through a full circle, the configuration space of the mechanism is S 1 x S 1 x S 1 , 
the 3-torus. 



FIGURE 4.32: A three-gimbal mechanism. 

The operational space is the space traced out by the tip of the pointer on 
the inner gimbal. The other end of the pointer lies on all three axes of rotation 
and therefore is fixed. Furthermore, it'is easy to see that the tip of the pointer 
can be oriented in any direction in 3-space, and therefore the operational space 
is a sphere. 

While the mechanism allows the arrow to point in any direction, there are 
configurations for which freedom of motion is restricted. These configurations 
of the mechanism give rise to the phenomenon known as gimbal lock and 
occur when the three gimbals are coplanar, as shown in Figure 4.33. With 
the gimbals thus configured, rotation of either the outer gimbal or the inner 
gimbal results in the arrow tracing a path on the equator of the operational 
space sphere. In contrast, rotation of the middle gimbal keeps the arrow fixed 
on the same point x. It follows that every smooth motion of the mechanism 
through this configuration results in a path that is tangent to the pictured equator 
at jc. Consequently, if the mechanism passes through this configuration and 
generates a path that is not tangent to the equator, then the mechanism must 
come to rest at this configuration. The configuration space point associated 
with such a configuration is called a singular point of the forward kinematics 
map. 

We do not provide a formal definition of singular points of the forward 
kinematics map since it requires the development of concepts from the fields 
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FIGURE 4.33: The gimbal-lock configuration. 

of differential geometry and differential topology. It is important to realize, 
though, that singular points are points in configuration space corresponding to 
configurations of the mechanism, rather than points in the operational space. 
In this example, if the gimbals are configured as in Figure 4.32, then the 
corresponding point in operational space is the same point x as in the gimbal- 
lock configuration depicted in Figure 4.33. The configuration in Figure 4.32 
is not singular and poses no restrictions on tracing paths through x. 

The configuration associated with gimbal lock posed a navigational chal¬ 
lenge during the National Aeronautics and Space Administration’s manned 
spaceflights to the moon in the late 1960s and early 1970s. Each spacecraft’s 
navigational system included a gimballed mechanism similar to the one shown 
in Figure 4.32. We present a simplified overview of that system, but the prob¬ 
lems caused by singular points in the configuration space are essentially the 
same. 

The navigational system’s inertial measurement unit (IMU) was attached 
to the spacecraft and contained a system of three gimbals in the arrangement 
shown in Figure 4.34. The inner gimbal held the stable platform, the main 
functional component of the IMU. Within the stable platform there were elec¬ 
trical and mechanical systems that served two important purposes. First, they 



FIGURE 4.34: The structure of the inertial measurement unit (IMU). 
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defined a reference frame for navigation in space. Second, as long as gimbal 
lock was avoided, they kept the stable platform in a fixed orientation when the 
spacecraft maneuvered, thereby keeping the reference frame constant. 

Gimbal lock occurs when the gimballed device is in the singular position 
shown in Figure 4.35. In this position, if the spacecraft turns on an axis 
perpendicular to the plane of the gimbals, then the three gimbals turn together in 
their plane, maintaining the gimbal lock configuration and rotating the stable 
platform so that the reference frame is lost. In fact, at gimbal lock every 
spacecraft motion including such a rotational component causes the reference 
frame to be lost. 



FIGURE 4.35: The reference frame can be lost as a result of a rotation in the 
gimbal-lock configuration. 

Gimbal lock was such a concern on these spaceflights, that a warning 
indicator was used to alert the astronauts when gimbal lock was imminent. 
The astronauts would then maneuver the spacecraft to enable the IMU to move 
away from gimbal lock, thereby averting the time-intensive task of resetting 
the reference frame. 


As is evident in the examples in this section, an analysis of the forward 
kinematics map is important in the design of mechanical systems. The forward 
kinematics map can provide critical information about impossible, restrictive, 
or potentially problematic configurations of a mechanism under consideration. 

Exercises for Section 4.3 

4 . 38 . Consider the two-rod linkage in Figure 4.24 and the two operational space paths 
shown in Figure 4.31. In diagrams of the configuration space torus, sketch paths 
q\ and qi that trace paths p\ and p 2 , respectively. 

4 . 39 . Consider the two-rod linkage in Figure 4.24. A configuration (0 a, 0b) of this 
linkage is a singular configuration if at configuration (0a, 0b) the operational 
space path obtained by holding 0a fixed and varying 0b is tangent to the path 
obtained by holding 0b fixed and varying 0a- Determine the singular configu¬ 
rations of this linkage. 
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4.40. Consider the linkage, called Watt’s Parallel Motion, illustrated in Figure 4.36(i). 
Each rod A and B has one end pinned in the plane and the other end connected to 
an end of rod C. The rods can rotate at each pinned point and at each connection 
point. Imagine we have a pen at the midpoint of rod C. As illustrated, 8 a is the 
angle that rod A makes, measured counterclockwise from the horizontal axis; 
the same holds for 8 b and B . The configuration space is the set of pairs of angles 
(6A . &b) corresponding to possible configurations of the linkage; it is illustrated 
as a subspace of the 8a&b -plane in Figure 4.36(ii). The operational space is the 
subspace of the plane traced out by the pen; it is illustrated in Figure 4.36(iii). 
You might want to make a working model of the linkage to see how it operates. 



FIGURE 4.36: Watt’s Parallel Motion, its configuration space, and its operational 
space. 

(a) Consider the set of configuration space points (8a, 8b). Explain and il¬ 
lustrate why there are two values of 8a that have only one corresponding 
value of 8b . Explain and illustrate why all other values of 8a have two 
corresponding values of &b . 

(b) Show the behavior of the forward kinematics map for Watt’s Parallel Mo¬ 
tion by picking a variety of points on the configuration space diagram 
and demonstrating where they map on the operational space figure eight. 
Among your points, include the two points in the configuration space that 
map to the crossing point in the figure eight. 






Chapter 5 

Metric Spaces 


One of the most common and useful types of topological space is the so-called 
metric space. Metric spaces are topological spaces that result from having a 
means for measuring distance between points in the underlying set. This notion 
of measuring distance goes beyond stretching out a measuring tape to see how 
far apart two objects are. For example, as we will see, we can measure the 
distance between two functions by considering the area bounded between their 
graphs, and we can measure the distance between two words by considering 
how many letter changes take us from one to the other. The ability to measure 
and compare distances between elements of a set is often crucial, and it provides 
more structure than a general topological space possesses. 

Metric spaces play a major role in the mathematical field of analysis, and 
they appear in a variety of interesting applications. In Section 5.2, we present 
applications to error-correcting codes and DNA sequences. 

First, we begin the chapter with an introduction to metrics, in Section 5.1. 
After .discussing the applications in Section 5.2, we examine properties of 
metric spaces in Section 5.3 and introduce the concept of metrizability in 
Section 5.4. 


5.1 Metrics 

DEFINITION 5.1. A metric on a set X is a function d : X x X ->■ R with the 
following properties: 

(i) d(x, y) > 0 for all x, y e X; equality holds if and only ifx = y. 

(ii) d(x, y) = d(y, x) for all x , y € X. 

(iii) d(x, y) + d(y, z) > d(x,z) for all x,y,z € X (the triangle in¬ 
equality). 

We call d(x,y) the distance between x and y, and we call the pair (X, d), 
consisting of the set X and the metric d, a metric space. 

Note that d has the properties we expect when we measure distance between 
points. The distance between two points is at least 0, and it equals 0 only when 
the two points are the same. The distance from point x to point y is the same 
as the distance from point y to point x. Finally, the distance to travel from x 
to y and then y to z is never shorter than the distance to travel directly from x 
to z. 
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EXAMPLE 5.1. On M, define d(x, y) = \x — y|. This is called the Eu¬ 
clidean metric or standard metric on ®. Conditions (i) and (ii) for a metric 
are immediate. The triangle inequality can be easily verified by considering 
separately each of the orderings of the three points x, y and z. For instance, in 
the case that y < x < z, we have 


d(x, y) + d(y, z) = (x - y) + (z - y) 

> z - y 

> z - x 
- d(x,z). 


EXAMPLE 5.2. We introduce three different metrics defined on the plane 
M 2 . In Figure 5.1, we illustrate how each measures the distance between two 
points p = (pi , p 2 ) and q = (q i, q 2 ). 

In Section 0.4 we defined 

d(p, q) = i/(pi - q \) 2 + (pi - qi ) 2 

as the Euclidean distance formula. There we indicated that d satisfies the three 
properties that make it a metric on the plane. We call d the Euclidean metric 
or standard metric on K 2 . This metric measures the straight-line distance 
between points in the plane. 

Next, define dr(p,q) = Ipi — q\\ + \P2 — qi\. It is straightforward 
to show that dj satisfies the properties of a metric. (See Exercise 5.1.) This 
metric is called the taxicab metric or the Manhattan metric, since it measures 
the total distance traveled vertically together with the total distance traveled 
horizontally, as if you were restricted to travel on a city grid of parallel and 
perpendicular streets running North-South and East-West. 

Finally, define d^ip, q) = max{|/?i — q\\, \p 2 — q 2 \}- The function du 
is a metric. (See Exercise 5.2.) It is called the max metric. In this case, the 
distance between two points is the maximum of the differences between their 
coordinates. 



FIGURE 5.1: Measuring distance in the Euclidean, taxicab, and max metrics. 

_i 
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Earlier, when we presented the standard topology on the plane, we defined 
it via a basis of open balls using the Euclidean distance formula. The proof 
that the collection of open balls is a basis depended only on the fact that the 
Euclidean distance formula satisfies the properties of a metric. Therefore, as 
we show next, given a metric on a set, we can define a topology on the set via 
open balls that are determined by the metric. 

DEFINITION 5.2. Let {X, d) be a metric space. For x s X and e > 0 define 
the open ball of radius e centered at x to be the set 

B d (x, e) = {y € X \ d(x, y) < e}, 

and define the closed ball of radius s centered at x to be the set 

B d (x, e) = {y e X \ d(x, y) < e). 

THEOREM 5.3. Let ( X , d) be a metric space. The collection of open balls, 
B = {B d (x, e) | x e X, s > 0}, is a basis for a topology on X. 

Before we prove Theorem 5.3, we need the following lemma that shows 
that, if a point y lies in some open ball B d (x, e), then an open ball centered at 
y lies in B d (x, e) as well: 


LEMMA 5.4. Let(X, d) be a metric space. Ifx 6 X, e > 0, and y e B d (x, e), 
then there exists 5 > 0 such that B d (y, S ) C B d (x, e). 

Proof. Set S equal to e — d(x, y), as illustrated in Figure 5.2. We claim 
that B d (y, S) c B d (x, e). 



FIGURE 5.2: Set 5 equal to e - d(x, y). 

To prove the claim, let z € B d (y, S ) be arbitrary; then d(y, z) < 8. 
Hence, 


d(x, y) + d(y, z) < d(x, y) + 8 

< d{x, y) + (e - d(x, y)) 

= e. 

Thus d(x, z) < £, implying that z e B d (x, e). Therefore B d (y, 8) C 
B d (x,e). ■ 
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Proof of Theorem 5.3. We need to check that B is indeed a basis. It is 
certainly true that every point x in X is contained in a set in B. In fact, 
x € Bd(x, e ) for every £ > 0. 

To see that the second condition for a basis is satisfied, we must show 
that if x € B\ fl B 2 , and B \, B 2 e B, then there exists B 2 e B such that 
x € B 2 c B\ n B 2 . 

Let B\ and B 2 be two sets in B, and suppose that x e B\ fl B 2 . 
Then by Lemma 5.4 there exist , S 2 > 0 such that B d (x, 8\) C B\ and 
Bd(x, S 2 ) C B 2 . Let 5 = min{<Si, <$ 2 }- Then x € B d (x, 8) C B\ fl B 2 , 
as desired. 

It follows that B is a basis for a topology on X. ■ 

Since the collection of open balls associated to a metric is a basis, we 
immediately obtain a topology: 

DEFINITION 5.5. Let ( X , d) be a metric space. The topology generated by 
the basis of open balls B = {B d (x, e) \ x e X, s > 0} is called the topology 
induced by d and is referred to as a metric topology. 

From now on, when we refer to a metric space ( X , d), we assume it is a 
topological space with the metric topology induced by d. 

The following theorem provides a useful condition for a set to be an open 
set in a metric topology: 

THEOREM 5.6. Let ( X , d) be a metric space. A set U C X is open in the 
topology induced by d if and only if for each y € U, there is a 8 >0 such that 
B d (y , 5) c U. (See Figure 5.3.) 

Proof. See Exercise 5.9. ■ 



FIGURE 5.3: A set U is open if and only if for each y in U there is an open ball 
centered at y and contained in U. 

In a metric space the open balls are open sets since they are basis elements 
for the topology. It is straightforward to show that the closed balls are closed 
sets. (See Exercise 5.14(a).) Furthermore, justifying the notation B d for closed 
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balls, it follows that the closed ball B d (x, e ) is the closure of the open ball 
B d (x, £•). (See Exercise 5.14(b).) 


EXAMPLE 5.3. Consider the metric on R given by d(x,y) = |x — y\. We 
see that the basis elements associated with the metric d are the open intervals 

B d (x, e) — {y e R | \x — y| < e} = (x — s, x + e). 

Since every open interval (a, b ) in the real line can be expressed in the form 
(x — s, x + e) by setting x = and e = it follows that this basis 
is exactly the basis for the standard topology on R. Therefore, the topology 
induced by the standard metric on R is the standard topology. 


EXAMPLE 5.4. Consider again the three metrics on the plane R 2 introduced 
in Example 5.2. In Figure 5.4 we illustrate example open balls associated with 
each of the metrics. 



FIGURE 5.4: Open balls in the standard, taxicab, and max metrics. 


In the standard metric given by 

d(p, q) = y](p\ - q \) 2 + ( P2 - qi) 2 , 

the resulting basis elements are the usual round open balls in the plane. In 
Example 1.11 we defined the topology resulting from this basis as the standard 
topology on R 2 . So the standard topology on R 2 is the topology induced by 
the standard metric d. 

Next, consider the taxicab metric defined by 

drip, q) = \Pi~q\\ + \P2 ~ qil 

The resulting basis elements are of the form 

B dT (p,E) = [q 6 R 2 | \p\ -q\\ + \pi~qi\ < e}. 

Here, the open ball B dr (p, s) is an open diamond, centered at p, with distance 
s from p to the corners. Using Theorem 1.13, it can be shown that this metric 
also induces the standard topology on R 2 . In Section 5.3, we will introduce 
a general result (Theorem 5.15) that allows for straightforward comparison of 
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metric topologies; we use it there to prove that the taxicab metric induces the 
standard topology on R 2 . 

Finally, consider the max metric defined by 

d M (p, q ) = max{|/?i - q\ |, \p 2 - qi\). 

An open ball Bd u ( p , e) in this metric is an open square in the plane, centered 
at p, with edge length equal to 2s. Here too. Theorem 1.13 or Theorem 5.15 
can be used to show that the max metric induces the standard topology on R 2 . 
(See Exercise 5.26.) 


As with the plane R 2 , the Euclidean distance formula on R” defines a metric 
(called the Euclidean metric or the standard metric) and the resulting metric 
topology is the standard topology on R” that was introduced in Section 1.2. 


i 


EXAMPLE 5.5. Let C[a, b] be the set of continuous functions f:[a, b] -> R. 
Given two such functions / and g, define 

P(f,g)=[ \f(x)-g(x)\dx. 

Ja 

This function measures the area between the graphs of / and g from x = a to 
x = b. (See Figure 5.5.) 



FIGURE 5.5: The distance between / and g is the area between their graphs. 

It is straightforward to show that p satisfies the properties of a metric. (See 
Exercise 5.7.) 

How do we know p(f,g) is defined at all, in particular that the integral in the 
definition is finite? It turns out that topology provides an answer. Since / and g 
are continuous functions, so is the integrand \f — g| in the definition of p(f, g). 
In Chapter 7, which introduces the topological property of compactness, we 
will see that on a compact domain (for example, a closed and bounded interval 
in R) a continuous real-valued function has a maximum and a minimum value. 
Therefore, if M is the maximum value for \f — g| over [a, b] then 

0< f \f(x) — g(x)\ dx < M(b — a), 

Ja 

and it follows that p(f, g) is finite. 
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The topological space consisting of C[a, b ] with the topology induced by 
p is an example of a function space. In the field of mathematical analysis, 
function spaces are used extensively in the study of mappings, called operators, 
that send functions to functions. 


Exercises for Section 5.1 


5.1. Show that the taxicab metric on R 2 satisfies the properties of a metric. 

5.2. (a) Show that the max metric on R 2 satisfies the properties of a metric. 

(b) Explain why d(p, q ) = min{|pj — q\ |, \p 2 — 42 II does not define a metric 
on R 2 . 

5.3. For points p = (pi, p 2 ) and q — (q\, q 2 ) in R 2 define 


dvip.q) 


f 1 if Pi #4i °r \p 2 — 42 1 > 1, 

1 \P 2 ~ 421 if P\ = 41 and \p 2 - 421 < 1. 


(a) Show that dy is a metric. 

(b) Describe the open balls in the metric dy . 

5.4. For points p and q in the circle S 1 , define d(p,q) to equal the minimum 
nonnegative angle (in radians) needed to rotate the circle so that the points p 
and q coincide. Prove that d is a metric on S 1 . 

5.5. Let X be a nonempty set. Define d on X x X by 


Show that d is a metric, and determine the topology on X induced by d. 

5.6. Let d be a metric on a finite set X. Prove that the topology on X induced by d 
is the discrete topology. 

5.7. Consider p as defined in Example 5.5. 

(a) Use the basic properties of integrals to prove that p is a metric. 

(b) Explain why we cannot generally define p(f, g) for functions / and g that 
are continuous on the open interval (a, b). 

5.8. On C[a, b], the set of continuous functions / : [a, b] —» R, define Pm(J, g ) = 
max XS [ a ,fe]{|/(x) — g(jc)|}. Assuming that such a maximum always exists 
(something we prove in Chapter 7), prove that pm is a metric. 

5.9. Prove Theorem 5.6: Let ( X , d) be a metric space. A set U c X is open in the 
topology induced by d if and only if for each y e U, there is a 5 >0 such that 
Bd(y,S) c u. 

5.10. (a) Let (X, d) be a metric on a space. For x, y e X, define 


D{x, y) 


d(x, y) 

1 +d(x, y) 


Show that D is also a metric on X. 

(b) Explain why no two points in X are distance one or more apart in the 
metric D. 
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5.11. On the set of integers Z, show that the function d, defined as follows, is a metric: 


d(x, y) 


| 0 if x = y, 

[ min{ £ | n\ divides \x — y|} if x # y. 


5.12. Let S be the set of sequences of Os and Is. For x 
y = (yi, y2, ^3, - - -) define 


d(x, y) 


OO 


E 


\*j-yj\ 

v 


(x],X 2 ,X 3 ,...) and 


(a) Explain why the infinite sum in the definition of d(x, y) converges for all 
x and y. 

(b) Prove that d(x, y) is a metric. 

(c) Let E be the subset of S consisting of all sequences that are eventually 0. 
Thus, x = Oq, X 2 , x^,.. .) is in E if there exists N >0 such that x„ = 0 
for all n > N. Prove that E is dense in S under the topology induced by 
d. 

5.13. Let (X, d) be a metric space. Prove that the distance function d : X x X -» M 
is continuous, assuming that X x X has the product topology that results from 
each copy of X having the topology induced by d. 

5.14. Let (X, d) be a metric space. 

(a) Show that the closed balls in the metric d are closed sets in the topology 
on X induced by d. 

(b) Show that for e > 0 and x € X, the closed ball B t /(x, e) is the closure of 
the open ball Bj(x,e). 

5.15. Let (X, d) be a metric space and assume that A c X. Prove that x e C1(A) if 
and only if there exists a sequence in A converging to x. 


5.2 Metrics and Information 

Metric spaces are used in numerous applications involving the storage, manip¬ 
ulation, and presentation of information. Strings of symbols, like the letters 
making up the words you read here, are the basic information units. In any 
situation where we wish to measure the similarities and differences between 
information units, an appropriate metric can be found to do so. In this section 
we look at two particular examples. In the first, the information units are trans¬ 
mitted binary codes, and in the second the information units are the sequences 
of letters modeling nucleotide strands in a DNA molecule. 

Error-Correcting Codes 

With the incredible amounts of information being transmitted over phone lines, 
through the internet, or from satellites in space to Earth, it is extremely im¬ 
portant to know whether a given message has arrived intact. We expect that 
there will be some errors in transmission due to electrical surges, cosmic ra¬ 
diation, or a variety of other factors. We want to be able to recognize when 
this occurs and to correct the faulty message. This brings us to the theory of 
error-correcting codes. 
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Suppose that we want to send a certain message. We assume that the 
message has been encoded in a binary code, which is to say that our message 
consists of a finite sequence of Os and Is, say n of them, which we call a word. 
We also assume that in transmission, some number of Os may be turned into 
Is and vice versa. We do not allow for lost entries or additional entries, so the 
word that arrives also has length n. Note that what we call a word here could 
consist of a few words, as we normally think of them, making up a particular 
message. 

Each word of length n can be thought of as a vector of length n, with all 
entries either Os or Is. We write the set of all these possibilities as V" = 
{(ai,... , a n ) | ai € {0,1}}. So V n is the product of n copies of the set {0,1}. 
We now put a metric on this set. 

DEFINITION 5.7. The Hamming distance Dh(x, y) between two words of 
length n is the number of places in which the words differ. 

For example, given 


x = (0,0, 1,1, 0,0, 0,1,0), 

y = ( 0 , 1 , 0 , 1 , 0 , 0 , 1 , 1 , 0 ), 

we find that x and y differ in the second, third and seventh places, and therefore 
Dh(x, y) = 3. 

The Hamming distance is a metric on V n . (See Exercise 5.16.) Since V n 
is a finite set, the topology that the Hamming distance induces is the discrete 
topology. (See Exercise 5.6.) 

In reality, we do not use all of the elements of V n to send messages, any 
more than we use eveiy possible string of n letters as a word in the English 
language. We take a subset of the words in V n , and from that we pick the 
words we transmit. 

DEFINITION 5.8. A code of length n is any subset C of V". We call the 
elements of C the codewords. 

If the sender and receiver have agreed on a particular code, then when a 
word arrives that is not one of the codewords, the receiver knows that at least 
one error has occurred in transmission. 

DEFINITION 5.9. Let C be a code of length n. Define the minimum distance 
of the code C to be the least Hamming distance between two codewords in the 
code. 


EXAMPLE 5.6. Consider the code of length 6 given by 

C = {(0, 0, 1, 0, 0, 0), (1, 0, 0, 1, 1, 1), (1, 1, 1, 0, 1, 1), (0, 1,0,0, 1,0)}. 

You can check that the minimum distance between these codewords is 3. 

If we receive a message that we know has at most one error, meaning one 
change in one of its entries, then we can tell which codeword it is supposed to 
be. There is no word in V" that is within one error of two different codewords 



170 Chapter 5 Metric Spaces 


in C. If there was, then those two codewords would be within a distance of 2 
of one another by the triangle inequality, but that contradicts the fact that the 
minimum distance between any two codewords is 3. 


This demonstrates the basic notion of error-correcting codes. We place an 
open ball of integer radius r around each codeword. If a word is sent to us and 
r — 1 or fewer errors occur in the transmission, then the word we receive lies in 
the open ball of radius r centered about the codeword that was originally sent, 
though it may lie in open balls of radius r centered around other codewords. 
However, if the open balls of radius r about the different codewords do not 
overlap, then the word we receive must be in a unique open ball, and we know 
exactly which codeword was sent. We can correct the errors and determine the 
intended word. The following theorem makes this rigorous. Let [x\ denote 
the largest integer less than or equal to x. 


THEOREM 5.10. If a code C of length n is chosen so that its minimum 
distance is d, then every message of length n with J or fewer errors can 
be corrected. 

Proof. Suppose c is the original codeword that was sent and / is the 
word that arrives. Since / has no more than errors, we know that 

c) < . Suppose that there is a second codeword c' such 

that D//(/, c') < . Then we find that 

D h (c,c') < 

< 


a contradiction to our minimum distance of d. Hence, c is the only 
codeword that is within J of /, and therefore must be the codeword 
that was sent. ■ 


D H (c, f) + D H (f, c') 



d- 1, 


There are two useful attributes we would like a code to possess. First, 
we want it to allow us to correct a large number of errors. This means that 
we do not want to pick too many codewords, since we would like there to be 
large open balls that are disjoint from each other, each centered at a codeword. 
Second, we would like to have enough codewords to enable the sending of a 
variety of different messages. For example, the code C = {(0, 0,0,0,0)} lets 
us correct up to five errors, but it does not allow the transmission of useful 
information since it is the only word we can send. Ultimately, we want to 
maximize the number of disjoint open balls of a given size that we can fit in 
V”. (See Exercise 5.19, for example.) 

Detecting and correcting errors are active areas of research in the fields 
of computer science and information science. Metrics, such as the Hamming 
distance, naturally play a key role in this work. 
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DNA Sequences 

DNA is a long thin molecule made up of millions of atoms. Within its struc¬ 
ture lies the code that determines our genetic makeup. Like RNA (introduced 
in Section 1.4), DNA is composed of nucleotides. While an RNA molecule 
consists of a single chain of nucleotides, a DNA molecule consists of two 
chains wound together to form the familiar double helix, as illustrated in Fig¬ 
ure 5.6. The nucleotides in DNA come in four types: adenine (A), cytosine 
(C), guanine (G), and thymine (T). In our introduction to RNA we showed that 
nucleotides in a chain tend to pair together, contorting the chain into a folded 
shape. Nucleotides in a DNA chain also pair, but do so with their neighbors on 
the opposite chain (adenine pairs with thymine, and guanine pairs with cyto¬ 
sine). In fact, the two chains are constructed such that every nucleotide on one 
chain pairs with its neighbor on the other. Thus the sequence of nucleotides on 
one chain determines the sequence on*he opposite chain, and we can represent 
part or all of a DNA molecule with a sequence of the letters A, C, G, and T, 
corresponding to the sequence of nucleotides found in one of the two chains. 



FIGURE 5.6: Diagram of a DNA molecule. 

One of the most important problems in DNA research is how to compare 
distinct DNA sequences. How different is one sequence of DNA from another? 
In some sense, this is a measure of the evolutionary distance between the 
two sequences (and, by extension, between the organisms from which they 
were derived). When a species splits into two new species, resulting in a 
fork in the evolutionary tree, the species’ initially identical DNA sequences 
begin to accumulate unique changes. Measuring the distance between the two 
sequences as a function of these differences provides insight into the nature of 
the evolutionary history of each species. 

During the course of evolution, DNA sequence differences arise in a variety 
of ways. One of the most common is nucleotide substitution, the apparent 
replacement of a letter in one DNA sequence relative to the original sequence. 
If this is the only type of change that has occurred between two sequences, 
then the Hamming distance provides a useful measure of the distance between 
them (by counting the total number of replacements). Another commonly 
occurring change in DNA is the insertion or deletion of nucleotides, realized 
as the insertion or deletion of letters in the corresponding DNA sequence. In 
this case, all of the subsequent letters in the altered sequence appear offset 
relative to the original sequence. This results in a large Hamming distance 
between the two DNA sequences, when in reality they are quite similar. To 
deal with this problem, we introduce another metric that is useful in making 
comparisons. 
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Let x and y be two sequences of the letters A, C, G, and T. We measure 
the distance between x and y by determining how many operations on x are 
necessary to turn it into y. We allow three types of operations on jc: We can 
insert any letter into x, we can delete any letter in x, and we can replace any 
letter with a different letter. For our particular x and y, we can use a sequence S 
of these operations to turn jc into y. Let is represent the number of insertions in 
the sequence, ds the number of deletions, and rs the number of replacements. 
So the total number of operations to turn x into y is is + ds + rs . But of 
course, there are many different choices of sequences of operations to turn x 
into y, and therefore we define the distance between x and y as follows: 

DEFINITION 5.11. The Levenshtein distance between sequences x and y is 
given by 


Dl(x, y) = mjn [i s +d s + r 5 } 

where the minimum is taken over all sequences S that turn x into y. 


EXAMPLE 5.7. Let jc = AGTTCGAATCC and y = AGCTCAGGAATC. 
Then we can get from jc to y by the following process: 

x : AGTTCGAATCC 

Replace T : AGCTCGAATCC 
Insert A : AGCTCAGAATCC 
Insert G : AGCTCAGGAATCC 
Delete C : AGCTCAGGAATC 

We can check, by examining all possibilities with three or fewer operations, 
that the fewest number of insertions, deletions, and replacements to get us 
from x = AGTTCGAATCC to y = AGCTCAGGAATC is four, as seen here. 
Therefore Di( jc, y) = 4. 


EXAMPLE 5.8. Let x = ACGTTGAATAC and y = AGGGTTGAATA. Visu¬ 
ally inspecting jc and y, we see that they appear somewhat similar. In fact, they 
have the segments GTTGAATA in common. It is not difficult to determine 
that the Levenshtein distance between jc and y is 3. In contrast, if we compute 
the Hamming distance between jc and y (counting the number of entries where 
they differ), we obtain 7. Thus, in this example we see that in comparison with 
the Hamming distance, the Levenshtein distance better reflects the proximity 
of jc and y resulting from their similar structure. 


The Levenshtein distance is also used for spell checking, speech recogni¬ 
tion, and plagiarism detection. It is just one of a variety of metrics researchers 
are using for DNA sequence comparison and analysis. 
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Exercises for Section 5.2 

5.16. Show that the Hamming distance is a metric on V n , the set of all words of 
length n. 

5.17. Can you extend the Hamming distance to be a metric on words of any length? 
That is to say, is there a way to extend the definition to obtain a metric that 
allows a distance between two words of differing length but that yields the 
same distance as the Hamming distance when applied to two words of the same 
length? 

5.18. Compare the Hamming metric and the Levenshtein metric on the set V". Is 
either metric, in general, always less than or equal to the other? Prove your 
assertion. 

5.19. We say that a code of length p corrects n errors if every word in V p is within 
distance n of a unique codeword in the code. 

(a) In V s , find an example of a code C that has four codewords and corrects 
two errors. 

(b) Show that four codewords is the maximal size for a code in V 8 that corrects 
two errors. 

5.20. Let W be the set of sequences of finite length made up of the letters A, C, G, 
and T. Show that the Levenshtein distance is a metric on W. 

5.21. For each of the following, find the Levenshtein distance between the two se¬ 
quences, x and y. In cases where the sequences have equal length, also compute 
the Hamming distance. 

(a) jc = ACGGTAT and y = GGTAG 

(b) x = CTGGTAC and y = CTAGATC 

(c) x = CCAGTCA and y = CCGTCTTA 

(d) x = TGACCGTTA and y = TGCGCTTAG 

5.22. A word was badly misspelled as TUPOTAGRY. Suppose that a spell checker 
changes the word to a word in its database that is the least Levenshtein distance 
away from the misspelling. Which of the following would the spell checker 
pick as the intended word: TOPOGRAPHY, TOPOLOGY, or TAUTOLOGY? 

5.3 Properties of Metric Spaces 

Metric spaces have many useful properties, one of which is presented in the 
following theorem: 

THEOREM 5.12. Every metric space is Hausdorff. 

Proof. Let ( X , d) be a metric space. Suppose x and y are distinct points 
in X with d(x, y) = e. Consider the sets U = B c j(x, e/2) and V = 
Bd(y, e/2). It follows that x e U, y e V, and U and V are open sets. 
We claim that U and V are disjoint. Suppose U DV ^ 0, and z is in the 
intersection. Then d(x, z) < e/2 and d{y, z) < e/2. Therefore, by the 
triangle inequality, 

d{x, y) < d{x, z) + d(z, y) < e/2 + e/2 = e; 

- that is, d(x, y) < e. This contradicts d(x, y) = e. Thus U IT V = 
0. Hence, there exist disjoint open sets U and V containing x and y, 
respectively, implying that X is Hausdorff. B 
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Theorem 5.12 implies that if a topological space is not Hausdorff, then it 
cannot be induced by a metric. For example, the topological space R/ c , the real 
numbers with the finite complement topology, is not Hausdorff and therefore 
cannot be induced by a metric on R. Also, the digital line is not Hausdorff 
and therefore cannot be induced by a metric on Z. We discuss this idea of a 
topological space being induced by a metric (being “metrizable”) further in 
Section 5.4. 

The s — 8 definition of continuity for functions / : R -> R generalizes 
to functions / : X -*■ Y if X and Y are metric spaces. The next theorem 
establishes that the resulting e — 8 definition is equivalent to the topological 
open set definition. 

THEOREM 5.13. Let ( X , dx) and ( Y , dy) be metric spaces. A function 
f : X -> Y is continuous in the open set definition if and only if for each 
x e X and e > 0, there exists a 8 > 0 such that if x' e X and dx(x, x') < 8, 
then dy(f (x), f{x')) < e. 

Proof. See Exercise 5.24. ■ 

In a metric space, not only can we measure distance between points, but 
we can also measure distance between sets, as follows: 

DEFINITION 5.14. Let (X , d) be a metric space. For sets A, B c X define 
the distance between A and B by 

d(A, B) = glb{d(a, b)\a e A, b e B}. 

The greatest lower bound in Definition 5.14 exists for every pair of sets A 
and B since the set of values {d(a, b) \ a € A, b € B) is bounded below by 0. 


EXAMPLE 5.9. Unlike the situation with points, if the distance between two 
sets is 0, the sets need not be equal. For example, in the standard metric on R 2 , 
if we let A be the x-axis and B be the y-axis, then d(A, B) = 0 but A ^ B. 
In fact, d(A, B) = 0 does not necessarily imply that A and B have a point in 
common. In R 2 again, with the standard metric, if we let A be the x-axis and 
B be the open ball of radius 1 centered at the point (0,1), then d(A, B) = 0 
but A n B = 0. (See Figure 5.7.) 



FIGURE 5.7: In both cases d(A, B) - 0. 
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We have already seen that a given set can have many different metrics. It 
will be useful to be able to compare the topologies they induce. The following 
theorem provides a means for doing so: 

THEOREM 5.15. Let d and d' be metrics on a set X, and let T and T, 
respectively, be the topologies that they induce. Then T' is finer than T if and 
only if for each x e X and e > 0, there exists a 8 > 0 such that Bjfx, 8) C 
Bd(x,e). (See Figure 5.8.) 



FIGURE 5.8: For each* e Xands > 0, there exists 8 such that B d r(x, 8) c B c i(x,e). 


Proof. Suppose that T' is finer than T. Then every open set in T is open 
in T'. In particular, for every x e X and e > 0, Bd(x, e ) is open in T 
and hence is open in T'. Since Bd(x, e ) is open in T' and contains x, 
Theorem 5.6 implies that there is a<$ > 0 such that B d <(x, 8) C B ( i(x, e), 
as we wished to show. 

Suppose now that for each x e X and e > 0, there exists a 5 > 0 
such that Bdfx, 5) c Bd(x, e). We prove that T' is finer than T. Let U 
be an open set in T. We show that U is open in T' . Let x be an arbitrary 
point in U. Since U is open in T, Theorem 5.6 implies that there is an 
e > 0 such that Bd(x, e) c U. By assumption there is a 8 > 0 such that 
Bd'(x,8 ) c Bd(x,e) c U. It follows that for each x e U there exists 
8 > 0 such that Bdfx, 8) c U. Theorem 5.6 implies that U is open in 
T', as we wished to show. ■ 

Using Theorem 5.15, we can easily prove that the standard metric, the 
taxicab metric, and the max metric all induce the same topology on K 2 . In 
the next theorem we prove that the first two of these metrics induce the same 
topology on M 2 . In Exercise 5.26 we ask you to show that the topologies 
induced by the taxicab metric and the max metric are the same. 

As we indicated in Section 1.2, the standard topology on the plane is not 
dependent on the particular shape of the basis elements; the round, open balls 
of the standard metric, the open diamonds of the taxicab metric, and the open 
squares of the max metric all generate the standard topology on the plane. 

THEOREM 5.16. The standard metric and the taxicab metric induce the 
same topology on M 2 . 
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Proof. We begin by showing that the topology induced by the standard 
metric d is finer than the topology induced by the taxicab metric dr- 
Let p e R 2 and e > 0 be arbitrary. Set <5 equal to e/2. We claim that 
Bdip, 8) c Bd T (p, e). Given the claim, it then follows by Theorem 5.15 
that the topology induced by d is finer than the topology induced by d.j. 

To prove the claim, let q e Bdip, 8) be arbitrary. Note that 

\pi -q\\ = \](pi -qi) 2 < 7 ip\ - q\) 2 + ipi - qi) 2 - dip, q). 

Since dip, q) < 8 = e/2, it follows that \p\ — q\\ < e/2. Similarly, 
\P 2 ~ qi\ < e/2. Thus, d T (p,q) = \pi - qi\ + \p 2 ~ qi\ < e, and 
therefore q e B C } T (p, e). Hence, Bd(p, 8) c Bd T (p, e), as we wished 
to show. 

Next, we show that the topology induced by dr is finer than the 
topology induced by d. Let p e R 2 and e > 0 be arbitrary. Here we set 
8 = e/V2. We claim that Bd T (p , 8) c Bd(p , e), implying the desired 
result by Theorem 5.15. 

Thus let q € Bd T (p, 8) be arbitrary. Note that 

Ipi -q\\ < \p\ -q\\ + \P 2 -q 2 \ < 8. 

Similarly, \p 2 — q 2 \ <8. Therefore, 

d{p, q) = yj(pi - q\) 2 + (P2 ~ qi) 2 < y/s 2 + 8 2 = 8\/2 = e. 
Hence, q € Bdip, «), implying that Bd r ip, 5) C Bdip, e) as claimed. 


Of course, not all metrics on the plane induce the standard topology on the 
plane. The function dy defined by 


dyip,q) 


1 if pi 7^ q\ or \P2 ~ q2\ > 1 . 

\P 2 -q 2 \ if Pi = q\ and |p 2 — 92I < 1. 


is a metric that induces a topology on the plane that is strictly finer than the 
standard topology. (See Exercises 5.3 and 5.27.) 

We next show how we can dramatically alter a metric and still leave the 
topology it induces unchanged. 


DEFINITION 5.17. Let (X, d) be a metric space. A subset A of X is said 
to be bounded under d if there exists a p > 0 such that dix,y) < p for all 
x, y e A. If X itself is bounded under d, then we say that d is a bounded 
metric. 

Notice that with a bounded metric on a set X, every subset of X is itself 
bounded. 

Amazingly enough, whether or not a metric is bounded does not have 
implications for the topology that it induces. The next theorem shows that 
every topology induced by a metric is induced by a bounded metric. 
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THEOREM 5.18. Let (X , d) be a metric space, and define d' : X x X —> R 
by d'(x, y ) = min{d(x, y), 1}. Then d' is a bounded metric that induces the 
same topology as d. 

Proof. Since d' cannot have a value greater than 1, it immediately follows 
that d' is bounded. But we do need to check that d' is a metric. Clearly, 
d'(x, y) > 0 for all x and y in X. We also see that d'(x, y) = 0 if and 
only if d(x, y) = 0, which happens exactly when x = y. In addition, 
d'(x, y) = d'(y, x), since the same holds true for d. Hence, all we have 
left to show is the triangle inequality. 

To prove the triangle inequality, we consider two cases for x, y, z e 
X. First, suppose that d(x,y ) > 1 or d(y,z) > 1. Then, it fol¬ 
lows that d'(x,y) + d'(y,z ) > 1. But d'(x,z) < 1, so we have 
d'(x, y) + d'(y, z) > d'(x, z). On the other hand, suppose d(x, y) < 1 
and d(y,z) < 1. Then, 

d\x, y) + d'(y, z) = d(x, y) + d(y, z) > d(x, z) > d'(x, z). 

Thus, d' satisfies the triangle inequality. 

We use Theorem 5.15 to show that the topologies T and T', induced 
by d and d’, respectively, are the same. First, we show that T is finer 
that T'. Let x e X and e > 0 be arbitrary. We consider two cases 
fore. First, if e < 1, then Bd(x,e) = B^{x, e). However, if e > 1, 
then B d i (x , e) = X, and therefore B d (x, e) C B d * (x, e) clearly holds. 
In either case, if we let 8 = e, then we have B d (x, 8) c B d fx, e). It 
follows that T is finer than T'. 

Next we show that T is finer than T. Again take an arbitrary x e X 
and e > 0. As above, if e < 1, then B d (x, e) = B d fx, e). If e > 1, then 
B d /(x, 1) C B d (x, e). Thus, if we set 8 = min{e, 1}, then B d >(x, 8) C 
B d (x, e ) holds for all x e X and e > 0. Therefore T is finer than T. 

It now follows that the topologies induced by d and d' are the same. 


EXAMPLE 5.10. Ifwe replace the standard metric d(x, y) = |x — y| on 1R by 
the bounded metric d r (x, y) = min{l, \x — y|}, we again obtain the standard 
topology. But now our basis elements are open intervals of length at most 2 
and the set R itself. We obtain R as a basis element because B d ' (x , y) = R 
for all y > 1 and x e R. 


In Theorem 5.18 there is nothing significant about the value 1 as the cutoff 
for defining a bounded metric that induces the same topology as the metric 
d. We could cut d off at one-millionth and still obtain the same topology. In 
fact, Theorem 5.18 holds if we replace the bound of 1 with a bound given by 
an arbitrary s > 0. In some sense the topology induced by the metric only 
depends on what is really close together in the metric. 
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Next, we define a notion of equivalence for metrics. The idea is that two 
metric spaces are equivalent if there is a bijection between them that preserves 
distance. Specifically, we have the following definition: 


DEFINITION 5.19. Let ( X , dx) and (T, dy) be metric spaces. A bijective 
function f : X —>• Y is called an isometry ifdx(x, x') = dy(f(x), f {x')) for 
every pair of points x and x' in X. If f : X -> Y is an isometry, then we say 
that the metric spaces X and Y are isometric. 

In this definition it suffices to use a surjective function / in place of the 
bijective function /. If a function / preserves distance, then / is injective. 
(See Exercise 5.30.) Therefore / being surjective and distance preserving 
implies that / is bijective. We use bijective in the definition to emphasize the 
fact that an isometry must be both bijective and distance preserving. 

Just as homeomorphism is the fundamental equivalence between topolog¬ 
ical spaces, isometry is the fundamental equivalence between metric spaces. 
All of the properties of a given metric space are preserved by isometry. Two 
metric spaces that are isometric are indistinguishable except for the particular 
names on the elements. 

Isometry is a stronger form of equivalence than homeomorphism. In par¬ 
ticular, isometric spaces are homeomorphic (see Exercise 5.31) but not vice 
versa, as the following example indicates: 


EXAMPLE 5.11 . The plane with the taxicab metric dy is not isometric to the 
plane with the standard metric d, even though these two metrics induce the same 
topology. To see this, suppose that we have an isometry f : R 2 -*■ M 2 such 
that d(f(p), f(q)) = drip, q) for all p,q e R 2 . Consider the four points 
a = (0, 0), ft — (1, 0), y = (0. 1) and S = (1, 1). In the taxicab metric, 
dr (at, P) = 1, dr(a, y) = \,dj(f,8) = \,dy(y,8) = \, dj(a, 8) = 2, 
and dy(f, y) = 2, as illustrated in Figure 5.9. Since / is an isometry, in the 
standard metric we must have d(f(a), /(/?)) = 1, d(f(a), f(y)) = 1, and 
d(f (ft), f(y)) = 2. However, in the standard metric this can only occur if 
f(P), f(y), and f(a) lie in a line with f(a ) equidistant from f(f) and f(y). 
Similarly, f(f), f(y), and /(<$) must lie in a line with /(<$) equidistant from 
f(ft) and f(y). Hence,/(a) = /(<$). But this contradicts / being a bijection, 
and therefore no such isometry exists. 




' 




( 0 . 0 ) 


FIGURE 5.9: Distances between points in the taxicab metric. 


j 
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Exercises for Section 5.3 

5.23. Let ( X , d) be a metric space. Let A and B be disjoint subsets of X that are 
closed in the topology induced by d. Prove that there exist disjoint open sets U 
and V such that A c U and B c V. 

5.24. Prove Theorem 5.13: Let (X, dx) and (Y, dy) be metric spaces. A function 
f : X -*■ Y is continuous in the open set definition if and only if for each 
x e X and s > 0, there exists a 5 > 0 such that if x' e X and dx(x, x') < 5, 
then dy(f(x), f(x')) < e. (Hint: Consider Exercise 4.3 and the proof of 
Theorem 4.6.) 

5.25. Let (X, d) be a metric space, and assume p e X and Acl 

(a) Provide an example showing that d([p). A) = 0 need not imply that 
p e A. 

(b) Prove that if A is closed and d({p], A) = 0, then p e A. 

5.26. Use Theorem 5.15 to prove that the taxicab metric and the max metric induce 
the same topology on K 2 . 

5.27. Consider the metric dy on IK 2 defined by 

d v (pq)= 1 if pi ^q\ or \p 2 - q 2 \ > 1, 

' ' V \P 2 ~qi\ >f Pi =qi and |p 2 - 92 I < 1. 


(a) Use Theorem 5.15 to prove that the topology induced by dy is finer than 
the standard topology. 

(b) Show that the standard topology is not finer than the topology induced by 
dv- 

5.28. Let (X, d) be a metric space. The function 


D(x,y ) 


d(x,y) 

1 +d(x, y) 


is a bounded metric on X. (See Exercise 5.10.) Show that the topologies 
induced by D and d are the same. 

5.29. On the set of continuous functions C[a, b] consider the metrics pm and p 
defined by 


PMif , 8 ) = raax xe [a,b]{\f(x) - g(*)|}, and 

P(/> 8) = [ I/to -g(*)l dx. 

Ja 

These metrics were introduced in Exercise 5.8 and Example 5.5, respectively. 

(a) Use Theorem 5.15 to prove that the topology induced by pu on C[a, b] is 
finer than the topology induced by p. 

(b) Show that for every ci,C 2 > 0 there exists / e C[a,b ] such that 
max* e[a ,fc]{|/to|} = ci and 

l/tol dx = c 2 . 
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(c) Let Z € C[a, b ] be the function defined by Z(jc) = 0 for all x e [a, b]. 
Given s > 0, show that no S > 0 exists such that B P (Z, S ) c B Pm (Z, s). 
(Hint: Part (b) helps.) 

(d) What does Theorem 5.15 allow us to conclude from (c)? 

5.30. Let ( X , dx) and (Y, dy) be metric spaces. Show that if / : X -»• Y is such that 
dx(x, x') = dy(f(x), f(x')) for all x, x' s X, then / is injective. 

5.31. Let (X, dx) and ( Y,dy ) be metric spaces and f : X —*■ Y be an isometry 
between them. Show that / is a homeomorphism between the corresponding 
metric spaces. 

5.32. Show that the max metric on M 2 , given by 

dM(p,q)= max{|pi — q\\, |p 2 — <? 2 1}» 
is not isometric to the standard metric on R 2 . 


5.4 Metrizability 

There are many topological spaces that are not induced by a metric. For 
instance, Theorem 5.12 implies that a topology that is not Hausdorff cannot 
be induced by a metric. But when a topology is induced by a metric, we 
have some immediate additional structure that can be very useful. Thus, given 
a topological space, we are interested in knowing if it can be induced by a 
metric. 

DEFINITION 5.20. Let X be a topological space. We say X is metrizable if 
there exists a metric d on X that induces the topology on X. 


EXAMPLE 5.12. In Example 3.4 we introduced the standard topology on the 
circle 5* as the subspace topology inherited from the standard topology on the 
plane R 2 . Is this topology on S 1 metrizable? Yes! First, note that we obtain a 
basis for the subspace topology on S 1 by intersecting open balls in the plane 
with S 1 . The resulting sets are open intervals along the circle. 

Consider the metric on S 1 defined by setting d(p, q) equal to the minimum 
nonnegative angle (in radians) needed to rotate the circle so that points p and q 
coincide. (See Exercise 5.4.) In this metric, the resulting open balls are open 
intervals on the circle. Thus the basis of open balls for the topology on S 1 
induced by d is the same as the basis just described for the standard topology 
on S 1 . It follows that the standard topology on S 1 is metrizable. 


In the previous example we showed that the subspace S 1 of the metric 
space M 2 is itself metrizable. This idea holds in general. If X is a metric space 
and Y is a subset of X, then the subspace topology on Y is metrizable. (See 
Exercise 5.34.) 

Given a set X, the discrete topology on X is metrizable. (See Exer¬ 
cise 5.33.) In fact, if X is finite, then every metric on X induces the discrete 
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topology. (See Exercise 5.6.) Thus, on a finite set the discrete topology is the 
only metrizable topology. 

Metrizability is a topological property, as the following theorem indicates: 

THEOREM 5.21. IfX is a metrizable topological space and Y is homeomor- 
phic to X, then Y is metrizable. 

Proof. See Exercise 5.35. ■ 

It turns out that many topological spaces are metrizable, and mathemati¬ 
cians have long been interested in the question, “Can we determine if a topo¬ 
logical space is metrizable without explicitly finding a metric for it; that is, are 
there conditions on the space that ensure that it is metrizable?” The answer is 
yes. The famous Urysohn Metrization Theorem asserts that if two simple con¬ 
ditions are satisfied (the space is “regular” and has a countable basis), we are 
guaranteed that the space is metrizable. We discuss this result in the remainder 
of this section. 

The topological property of regularity is a strengthening of the Hausdorff 
property. Recall that a topological space X is Hausdorff if for every pair of 
distinct points in X, there exists a pair of disjoint open sets, each containing 
one of the points. 

DEFINITION 5.22. Let X be a topological space. We say that X is regular if 

(i) One-point sets are closed in X; 

(ii) For every a e X and every closed set B in X that does not contain 
a, there exist disjoint open sets U and V such that a € U and 
B cV. (See Figure 5.10.) 



FIGURE 5.10: Separating a point and a closed set in a regular space. 


EXAMPLE 5.13. The real line R with the standard topology is regular. One- 
point sets are closed in R. Choose a point x e R and a closed set C that 
does not contain x. The complement of C is an open set containing x, and 
therefore it must contain an open interval (a, b ) that contains x. We have 
a < x < b, and we can choose c and d so that a<c<x<d<b. Then 
U = (—oo, c) U (d, oo) is an open set containing C, and V = (c, d) is an open 
set containing x. Furthermore, U and V are disjoint, as required. 
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By combining conditions (i) and (ii) in the definition of regularity, it is 
straightforward to prove that if a topological space is regular, then it is Haus- 
dorff. However, without the requirement that one-point sets be closed, this 
need not be the case; specifically, it is possible to have a topological space that 
satisfies only condition (ii) in Definition 5.22 and that is not Hausdorff. (See 
Exercise 5.42.) 

While being regular implies being Hausdorff, the converse need not hold. 
In the next example we introduce a topological space that is Hausdorff but not 
regular. Thus regularity is a stronger property. 


EXAMPLE 5.14. On R, consider the collection consisting of all open inter¬ 
vals (a, b) and all subsets ( c , d) D Q. This is a basis for a topology on R, and the 
resulting topological space is Hausdorff but not regular. (See Exercise 5.37.) 


The properties of being Hausdorff and being regular are known as separa¬ 
tion axioms. There is another separation axiom worth mentioning, although 
it does not play a role in the Urysohn Metrization Theorem. 

DEFINITION 5.23. Let X be a topological space. We say that X is normal if 

(i) One-point sets are closed in X; 

(ii) For every pair of disjoint closed sets A and B in X, there exist 
disjoint open sets U and V such that A c U and B C V. (See 
Figure 5.11.) 



FIGURE 5.11: Separating two closed sets in a normal space. 

Not only is R regular in the standard topology, but it is normal as well. In 
fact, every metric space is normal. (See Exercise 5.23.) Furthermore, it is an 
easy consequence of the definitions that every normal space is regular. 

The second assumption in the Urysohn Metrization Theorem is that the 
topological space has a countable basis. Recall that a set is countable if it is 
either finite or can be put in a one-to-one correspondence with the positive 
integers. 
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EXAMPLE 5.15. The real line R with the standard topology has a countable 
basis. Consider the collection C — {( a,b ) | a,b e Q, a < b}. It is a 
countable collection of intervals since it is indexed by Q x Q, a countable set. 
(See Theorem 0.29.) Using Theorem 1.13, it is straightforward to show that C 
is a basis for the standard topology on R. 

Similarly, the standard topology on R" has a countable basis given by the 
collection of all products of open intervals having rational endpoints. 


Pavel Urysohn (1898-1924) was one of the most promising mathemati¬ 
cians of his era. Unfortunately, he drowned at the age of 26 while swimming in 
rough seas off the coast of France. It was in 1924 that he proved the following 
theorem that now bears his name: 

THEOREM 5.24. The Urysohn Metrization Theorem. If a topological 
space X is regular and has a countable basis, then X is metrizable. 

We do not prove the Urysohn Metrization Theorem since its proof requires 
tools that we have not developed. A proof of the Urysohn Metrization Theorem 
can be found in [Mun]. The idea behind the proof, however, is straightforward. 
Using the assumptions that X is regular and has a countable basis, it can be 
shown that X can be embedded in a metric space. Therefore X is homeomor- 
phic to a subspace of a metric space. Since a subspace of a metric space is 
metrizable, and since metrizability is a topological property, it follows that X 
is metrizable. 


EXAMPLE 5.16. The arithmetic progression topology on Z is defined via 
the basis consisting of all arithmetic progressions 

Aa.ib = {•••- a — 2b, a — b, a, a + b, a + 2b ,...}, 

for a, b € Z with b ^ 0. (See Exercise 1.15.) This basis is countable since it 
is a collection indexed by a subset of Z x Z, a countable set. 

Furthermore, the arithmetic progression topology on Z is regular. (See 
Exercise 5.39.) 

Since this is a regular topological space with a countable basis, it is metriz¬ 
able. There must be a metric that induces this topology. In fact, we can find it. 
The metric on Z given by 

d ( x 3,) - ° ifx = y> 

min{ ^ | n\ divides |x — y|} if x ^ y, 

induces the arithmetic progression topology on Z. (See Exercises 5.11 and 5.40.) 


The Urysohn Metrization Theorem indicates that every regular topological 
space with a countable basis is metrizable. Does the converse hold? In other 
words, must a metrizable space have a countable basis and be regular? The 
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answer to half of this question is yes. As we indicated above, a metric space 
is normal, and a normal space is regular; therefore a metric space is regular. 
However, the following example demonstrates that the answer to the other half 
of the question is no. A metric space does not necessarily have a countable 
basis. 


EXAMPLE 5.17. The real line R with the discrete topology is metrizable. 
(See Exercise 5.33.) Since this topology is discrete, every one-point set is an 
open set. There are uncountably many such sets since R is uncountable. 

Now, given a basis for a topology, every open set in the topology must be 
a union of basis elements. It follows that every basis for the discrete topology 
must include each one-point set as a basis element. Therefore every basis for 
the discrete topology on R must contain the uncountable collection of one- 
point sets. Thus, although R with the discrete topology is a metrizable space, 
it does not have a countable basis. 


Exercises for Section 5.4 


5.33. Let X be a set. 

(a) Show that the discrete topology on X is induced by the metric 


d(x, y) 


0 ifx = y, 
1 if x # y. 


(b) Is the trivial topology on X metrizable? 

5.34. Let (X, d) be a metric space, and let Y be a subset of X. Consider the function 
dy : Y x Y -> R, given by dy(x, y) = d(x, y). 

(a) Prove that dy is a metric on Y. 

(b) Prove that the topology induced by dy on Y is the subspace topology that 
Y inherits from the metric topology on X. 

5.35. Let (X, d) be a metric space. Assume that / : X -> Y is a homeomoiphism, 
anddefined* : Y x Y -> Rby d*(x,y) = d(f~ l (x), f~ l (y)). (Theorem5.21 
follows from exercises (a) and (b).) 

(a) Prove that d* is a metric on Y. 

(b) Prove that the topology on Y is induced by the metric d*. 

(c) Prove that / is an isometry between (X, d) and (L, d*). 

5.36. Prove that R with the lower limit topology is a regular topological space. 

5.37. On R, consider the collection B consisting of all open intervals (a, b) and all 
subsets (c, d) fl <Q>. 

(a) Show that B is a basis for a topology on R. 

(b) Show that R is Hausdorff in this topology. 

(c) Show that R is not regular in this topology. 

5.38. Show that if X and Y are regular, then so is the product space X xY. Conclude 
that R" is regular. 

5.39. Show that the arithmetic progression topology on Z is regular. 
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5.40. Show that the arithmetic progression topology on Z is induced by the metric 


d(x, y ) 


0 if x = y, 

min{ jjj | n! divides \x — y|} if x ^ y. 


5.41. Show that regularity and normality are topological properties. That is, show 
that if X is regular and Y is homeomorphic to X, then Y is regular (and do the 
same for normality). 

5.42. Show that the two-point space X — (*i, * 2 ) with the trivial topology satisfies 
the second condition in the definition of regularity; that is, show that for every 
point a in X and every closed set B in X not containing a, there exist disjoint 
open sets U and V such that a e U and B c V. (Note: Since one-point sets 
are not closed in this topological space, this is not a regular space. Nor is it 
Hausdorff. This demonstrates why in the definition of a regular topological 
space the second condition alone does not suffice if we wish to have a regular 
space be Hausdorff.) 



Chapter 6 
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There are several natural approaches that we could take to rigorously capture 
the concept of connectedness for a topological space. One approach might be 
to say that a topological space is connected if it cannot be broken down into 
two distinct pieces that are separated from each other. Another approach might 
be to say that a topological space is connected if we can take a continuous walk 
in the space from any point to any other point. In this chapter we define both 
of these types of connectedness. Spaces of the first type are called connected, 
while spaces of the second type are called path connected. We prove that these 
two types of connectedness are not the same, but that path connectedness does 
imply connectedness. 

As simple as the concept of connectedness appears, it has profound impli¬ 
cations for topology and its applications. We show that connectedness is the 
key concept behind the Intermediate Value Theorem. Also, we demonstrate 
how connectedness plays a role in applications in geographic information sys¬ 
tems, population modeling, and motion planning in robotics. Further, we use 
connectedness to help distinguish topological spaces. We begin in Section 6.1 
by defining connectedness and examining some aspects of it. In Section 6.2 
we show that Euclidean space is connected, and then we use connectedness- 
related properties to distinguish between some pairs of topological spaces. In 
Section 6.3 we prove the Intermediate Value Theorem and then use it to derive 
a few interesting mathematical and applied consequences. We define path con¬ 
nectedness in Section 6.4. There we show that being path connected implies 
being connected, and we introduce a space, the topologist’s whirlpool, that is 
connected but not path connected, establishing that connectedness and path 
connectedness are not equivalent properties. Finally, in Section 6.5 we explore 
an application of these concepts to motion planning for mobile robots. 

6.1 A First Approach to Connectedness 

We begin with a definition that addresses the first intuitive approach to con¬ 
nectedness described in the chapter introduction. 

DEFINITION 6.1. Let X be a topological space. 

(i) We call X connected if there does not exist a pair of disjoint 
nonempty open sets whose union is X. 

(ii) We call X disconnected if X is not connected. 

(iii) If X is disconnected, then a pair of disjoint nonempty open sets 
whose union is X is called a separation of X. 
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EXAMPLE 6.1. Consider the two topologies on the three-point set X = 
{<3, b, c} in Figure 6.1. In the first topology, X is connected since there is no 
pair of disjoint nonempty open sets whose union equals X. However, in the 
second topology, X is disconnected. The pair of open sets, U — {a, b] and 
V = {c}, is a separation of X. 

<OD> <d2S> 

a b c a b c 

FIGURE 6.1: Two topologies on X = [a, b, c}, one connected, one not. 


EXAMPLE 6.2. The subspace X = (—1,0) U (0, 1) of R is disconnected. 
The pair of sets, (—1,0) and (0, 1), is a separation of X. (See Figure 6.2.) 


f- 

-1 




o 


l 


FIGURE 6.2: X = (-1,0) U (0,1) is disconnected. 

_i 


EXAMPLE 6.3. If a set X consists of more than one point, and it has the 
discrete topology, then it is disconnected. If A is any nonempty proper subset 
of X, then the pair of sets, A and X — A, is a separation of X. 

_i 


EXAMPLE 6.4. A set X having the trivial topology is a connected topological 
space. There is no separation of X since there are no nonempty proper subsets 
of X that are open. 


EXAMPLE 6.5. If p e R, then R — {p} is a disconnected topological space 
since the pair of sets, U = (—oo, p) and V = (p, oo), is a separation of 
R — {p}. (See Figure 6.3.) 




* 

P 




FIGURE 6.3: R - {p} is disconnected. 
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The following result provides an alternative formulation of connectedness: 

THEOREM 6.2. A topological space X is connected if and only if there are 
no nonempty proper subsets of X that are both open and closed in X. 

Proof. See Exercise 6.2. ■ 

In every topological space X, the sets X and 0 are both open and closed. 
Thus, Theorem 6.2 indicates that X is connected if and only if those are the 
only sets that are both open and closed in X. 


-1 

EXAMPLE 6.6. In R with the lower limit topblogy, intervals [a, b) are both 
open and closed. Therefore Theorem 6.2 implies that R is disconnected in this 
topology. 

_i 

Our definition of connectedness applies to topological spaces. But we can 
extend it in a natural way to subsets of a topological space. 

DEFINITION 6.3. A set A contained in a topological space X is said to be 
connected in X if A is connected in the subspace topology. If A is not connected 
in X, we say it is disconnected in X. 


EXAMPLE 6.7. The subspace of R given by A = [—1,0) U (0, 1] is discon¬ 
nected. The sets U = [—1, 0) and V = (0, 1] form a separation of A. Hence 
A is disconnected in R. 


The next theorem yields an alternate characterization of disconnected sets 
in a topological space X. 

THEOREM 6.4. A set A is disconnected in X if and only if there exist open 
sets U and V in X such that A C U U V, U H A ^ 0, V fl A ^ 0, and 

u n v n A = 0 . 

Proof. Suppose that A is disconnected in X. Then there exist nonempty 
sets P and Q that are open in A, disjoint, and such that P U Q — A. 
Since P and Q are open in A there exist sets U and V that are open in 
X and such that U fl A = P and V n A = Q. Clearly, A C U U V, 
U (1 A ^ 0, V DA ^ 0, and Lm V n A = 0. 

Now suppose that U and V are open sets in X such that A c U U V, 
I/nA#0, VnA^0, and U D V D A = 0. If we let P = U fl A and 
Q = V n A, then it follows that the pair of sets, P and Q, is a separation 
of A in the subspace topology, and therefore A is disconnected in X. ■ 
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Sets such as U and V in Theorem 6.4 provide us with another notion of 
separation, defined as follows: 

DEFINITION 6.5. Let Abe a subspace of a topological space X. If U and 
V are open sets in X such that AcUUV,Uf)Af:0,Vr\A^0, and 
U D V fl A = 0, then we say that the pair of sets, U and V, is a separation 
of A in X. (See Figure 6.4.) 



FIGURE 6.4: The sets U and V form a separation of A in X. 

IMPORTANT NOTE: For a pair of sets U and V to be a separation of A in 
X, we do not require that V fl V is empty; we only need U fl V to be disjoint 
from A. 


EXAMPLE 6.8. Let A be the subset of the plane consisting of the two curves 
y = e x and y = 0, as illustrated in Figure 6.5. Is A connected? No. We can 
form a separation of A in the plane by letting U and V be the sets of points on 
either side of the graph of y = e x ~ l , shown as the dashed curve in the figure. 
Thus A is disconnected in the plane. 

Another separation of A in the plane is given by letting U' be the set of 
points below the curve y — e x and V' be the set of points above the x-axis. 
Note that U' and V’ are not disjoint, but they still form a separation of A in the 
plane since their intersection is disjoint from A. 



FIGURE 6.5: The set A is disconnected. 
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It is apparent that connectedness is a topological property since it is de¬ 
fined only in terms of open sets. It is straightforward to show that if X and Y 
are homeomorphic, then X is connected if and only if Y is connected. How¬ 
ever, as the following theorem indicates, we do not need the full strength of a 
homeomorphism to preserve connectedness: 

THEOREM 6.6. IfX is connected and f : X —» Y is continuous, then f(X) 
is connected in Y. 

Proof. Suppose that f{X) is not connected in Y. Then there exist open 
sets U and V that form a separation of f(X) in Y. The function / is 
continuous, and therefore f~ l (U) and / -I (V) are open in X. Both 
U and V have nonempty intersection with f(X)\ thus f~ l (U ) and 
f~\V) are nonempty. Furthermore f(X) c U U V, implying that 
X c f~'(U) U f~\V). Finally, since U n V n f(X) = 0, it follows 
that f~ l (U ) and f~ 1 ( V ) are disjoint. Therefore the pair of sets, f~ l (U) 
and f~ l (V), is a separation of X, contradicting the assumption that X is 
connected. Hence, f(X) is connected in Y. ■ 

The following lemma will be useful to us in a few proofs in this chapter: 

LEMMA 6.7. Let C and D be subsets of a topological space X. Assume that 
C is connected and C C D. Further assume that U and V form a separation 
of D in X. Then either C C U or C C V. 

Proof. Suppose that neither C C U nor C C V. Then f/fiC ^ 0 
and V D C ^ 0. It follows that U and V form a separation of C in X, 
contradicting the assumption that C is connected. ■ 

Our next theorem indicates that if C is connected in X, and we add limit 
points to C, then the resulting set is also connected in X. 

THEOREM 6.8. Let C be connected in X, and assume that C C A c Cl{C). 
Then A is connected in X. 

Proof. Suppose that A is not connected in X, and let U and V form a 
separation of A in X. Then by Lemma 6.7, either C c U or C c V. We 
may assume, without loss of generality, that C CU. Hence C fl V = 0. 
But, since U and V form a separation of A in X, it follows that A fl V ^0. 
Pick x e A n V. Now, x € A and A c C1(C) imply x e C1(C). But 
x € V, an open set in X which is disjoint from C. So x cannot be in 
the closure of C, yielding a contradiction. Thus, it follows that A is 
connected in X. ■ 

A union of connected subsets of a topological space is not necessarily 
connected. For example, A = {0} and B = {1} are connected sets in R with 
the standard topology, but A U B is not connected in R. However, as the 
following theorem indicates, if the sets in a collection of connected subsets of 
a topological space have at least one point in common, then we are ensured 
that the union of the subsets is connected: 
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THEOREM 6.9. Let X be a topological space, and let {C a } aeA be a collec¬ 
tion of connected subsets of X such that HaeA f 0. Then (J aeA * s 
connected in X. 

Proof. Suppose that UaeA is not connected in X. Thus there exist 
sets U and V that form a separation of U<*eA C* in X- L et x b e in 
PlaeA C a - Then either x € U or x e V, but both cannot hold. We 
may assume, without loss of generality, that x lies in U and does not 
lie in V. Lemma 6.7 implies that for all a e A, either C a c U or 
C a c V. Since x e U and x V, it follows that C a C U for all a € A. 
Thus U«€A C U, contradicting the assumption that U and V form 
a separation of [J aeA C a in X. Therefore U«sa is connected in X. 


While Theorem 6.9 is about unions of connected spaces, we can use it 
to prove that a product of connected spaces is connected. We do that in the 
following theorem: 

THEOREM 6.10. Let X\,... , X n be connected spaces. Then the product 
space X\ x ... x X n is connected. 

Proof. We prove the result for a product of two spaces. The general result 
can then be shown by induction. Assume that X and Y are connected 
topological spaces. We prove that X x Y is connected. First, note that 
for every x G X, the subspace {x} x Y of X x Y is homeomorphic to 
Y and is therefore connected. Similarly, for every y e Y, the subspace 
X x {y} of X x Y is connected. Thus, by Theorem 6.9, for every x e X 
and y e Y the set ({*} x Y) U (X x {y}) is connected in X x Y. (See 
Figure 6.6.) 


Y 


y 


x x 

FIGURE 6.6: The set ({*} x Y) u (X x {y}) is connected in X x Y. 

Now fix xo 6 X and let y vary. Each set ({xo} x T) U (X x {y}) 
contains the set {xo} x T. It then follows by Theorem 6.9 that 

U(({xo)xf)U(Xx{y))) 

yeY 
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is connected in X x Y. Furthermore, 

|J((Uo}xF)U(Xx{y))) = Xxy, 

yeY 

implying that X x Y is connected. (See Figure 6.7.) 
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FIGURE 6.7: The product X x Y is the union, over y in Y, of the subspaces (bo) x 
Y) U (X x {y}). 


A topological space can naturally be partitioned into a collection consisting 
of its largest connected subsets. We formalize this idea in what follows. 

Let X be a topological space. Define a relation ~c on X by x ~c y if 
x and y lie in a connected subset of X. We claim that ~c is an equivalence 
relation. It is clear that x ~c x for every x e X and that x ~c y implies 
y ~c x for every x,y e X. Suppose x, y, and z in X are such that x ~c y 
and y ~c z. Then x and y lie in a connected set C in X, and y and z lie in a 
connected set C' in X. Both C and C' contain the point y. Therefore C U C' is 
connected by Theorem 6.9. Since x and z lie in C U C', it follows that x ~c z. 
Thus, for all x,y,z € X, if x ~c y and y ~c z, then x ~c z. Hence ~c is 
an equivalence relation. 

DEFINITION 6.11. Let X be a topological space and let ~c be the equiv¬ 
alence relation on X defined by x ~c y if x and y lie in a connected subset 
of X. The components of X are the equivalence classes of the equivalence 
relation ~c- 

Since the components of a topological space X are equivalence classes 
under an equivalence relation, they form a partition of X. That components 
are the largest connected subsets of X follows from the first two parts of the 
next theorem. 

THEOREM 6.12. Let X be a topological space. 

(i) Each component of X is connected in X. 
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(ii) If A is connected in X, then A is a subset of a component ofX. 
(Hi) Each component ofX is a closed subset of X. 

We prove (i) here. For proofs of (ii) and (iii), see Exercise 6.9. 

Proof of (i). Let X be a topological space and C be a component of 
X. We prove that C is connected. Pick a point p e C. For every 
x e C, x ~c p since C is an equivalence class under the equivalence 
relation ~c that determines the components. Therefore, by the definition 
of ~c, there exists a connected set C x containing x and p. 

We claim that C x c C. To prove the claim, suppose y e C x . Then y 
and p both lie in the connected set C x , implying that y ~c p- Therefore 
y is in the same equivalence class as p, namely C, and it follows that 

c x cc. 

Now, by the Union Lemma we have Ujt 6 c C* = C. Therefore C 
is the union of the connected sets C x , and each of these connected sets 
contains the point p. By Theorem 6.9, it follows that C is connected. 


-1 

EXAMPLE 6.9. Consider R with the finite complement topology. It is a 
connected topological space. (See Exercise 6.1.) Therefore, in this topology 
there is only one component, the whole space R. 


EXAMPLE 6.10. Consider the real line R with the lower limit topology. Here 
we have a situation that is very different from the one in Example 6.9. We claim 
that the components are the one-point subsets of R. To prove this, we show 
that if a subset of R consists of more than one point, then it is disconnected. 
Thus let A be a subset of R containing two points x < y. Pick z such that 
x < z < y. The pair of sets, (—oo, z ) and [z, oo), is a separation of A in R 
with the lower limit topology, and therefore A is disconnected. Thus, the only 
nonempty connected subsets of R in this topology are the one-point sets, and 
it follows that they are the components. 


DEFINITION 6.13. A topological space X is called totally disconnected if 
the components of X are the one-point subsets of X. 

By Example 6.10, R with the lower limit topology is totally disconnected. 
Another totally disconnected topological space is the set of rational numbers 
Q in the standard topology. (See Exercise 6.7.) 

The following theorem indicates that homeomorphisms map components 
to components: 

THEOREM 6.14. Let f : X -* Y be a homeomorphism. IfC is a component 
ofX, then f(C) is a component ofY. 
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Proof. Suppose that / : X -* Y is a homeomorphism and C is a compo¬ 
nent of X. Then C is connected in X by Theorem 6.12. Thus Theorem 6.6 
implies that /(C) is connected in Y . Again by Theorem 6.12, it follows 
that /(C) is a subset of a component D of Y. We claim that /(C) = D, 
to complete the proof of the theorem. 

The component D is connected in Y , and / -l is a continuous func¬ 
tion; therefore /“*(£>) is connected in X. Furthermore, /(C) c D 
implies that C C /”’(£>). But C is a component of X, and C is a 
subset of the connected set Therefore C = / _I (D). Hence, 

/(C) = D, as we wished to show. ■ 

Exercises for Section 6.1 

6.1. Prove that an infinite set with the finite complement topology is a connected 
topological space. 

6.2. Prove Theorem 6.2: A topological space X is connected if and only if there 
are no nonempty proper subsets of X that are both open and closed in X. 

6.3. Prove that a topological space X is connected if and only if every nonempty 
proper subset of X has a nonempty boundary. 

6.4. (a) Prove that for all integers n > 0 the set {— n, — n + 1,... , n — 1, n] is 

connected in the digital line. 

(b) Use part (a) to prove that the digital line is connected. 

6.5. In the topology generated by the basis B = {(-a, a) \ a € R}, show that R is 
connected. 

6.6. Let X be a set and assume p e X. Prove that every subset of X is connected in 
the particular point topology PPX p on X and in the excluded point topology 
EPX p on X. (See Exercises 1.7 and 1.8.) 

6.7. (a) Prove that if a topological space X has the discrete topology, then X is 

totally disconnected. 

(b) Let Q be the set of rational numbers with the standard topology. Prove 
that Q is totally disconnected. (This exercise and Example 6.9 demonstrate 
that the converse to the result in part (a) does not hold. In both cases, the 
space is totally disconnected but does not have the discrete topology.) 

6.8. Let X be a topological space and {A; }yez + be a collection of connected subsets 
of X such that A; f1 A;+i ^ 0 for each j > 1. Prove that Uy e z + ' s 
connected in X. 

6.9. Prove Theorem 6.12, parts (ii) and (iii): Let X be a topological space. 

(a) If A is connected in X, then A is a subset of a component of X. 

(b) Each component of X is a closed subset of X. 

(c) Provide an example showing that the components of X are not necessarily 
open subsets of X. 

6.10. The following examples demonstrate that the condition U C\ V C\ A = 0 is 
appropriate in the definition of a separation of A in X and that the condition 
would be too strong if it required U n V = 0: 

(a) Find an example of a topology on X = {a, b, c } and a disconnected subset 
A such that every pair of sets, U and V, that is a separation of A in X 
satisfies U fl V n (X — A) # 0. 

(b) Find a topology on R and a disconnected subset A such that every pair of 
sets, U and V, that is a separation of A in R satisfies I/nfn(R-A)^0. 
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Supplementary Exercises: GIS Revisited 


In the following exercises, we continue the investigation of the intersection value and 
its application to geographic information systems, which we introduced in Section 2.4. 
Recall that in this application we are particularly interested in regularly closed planar 
sets, the interiors of which are not the union of two disjoint nonempty open sets. With 
the concept of connectedness available to us, we can now define these sets of interest 
in a more concise manner. Furthermore, we work here with a more general collection 
of sets than the planar spatial regions introduced in Section 2.4. In particular, we do 
not require the sets to be subsets of R 2 . 

DEFINITION 6.15. Given a topological space X, a spatial region A in X is a 
nonempty, regularly closed, proper subset of X that has a connected interior. 

SE 6.11. Let X be a topological space. Prove that if A is a proper subset of X, and 
A is the closure of a nonempty, connected open set, then A is a spatial region. 
(Note: The interior of A need not be the open set referred to in the statement of 
the problem, but it can be shown that the interior of A is connected as a result 
of the fact that A is the closure of a connected open set.) 

Recall the definition of the intersection value, given in Section 2.4. Since spatial regions 
are regularly closed sets. Corollary 2.19 applies, and it follows that the intersection val¬ 
ues (1,0, 1,0), (0,0, 1,0), (1,0,0,1), (0,0,0, 1), (1,0, 1, 1), and (0,0, 1, 1) are not 
possible for a pair of spatial regions. The following exercise eliminates one more pos¬ 
sible intersection value, as long as we assume that we have two spatial regions in a 
connected topological space. The plane is connected (see Corollary 6.18), and there¬ 
fore this exercise establishes the assertion in Theorem 2.21 that the only intersection 
values that are possible for a pair of planar spatial regions are (1, 1, 1,0), (0. 1, 1,0), 
(1, 1,0, 1), (0, 1,0, 1), (1,1,0,0),(0,0,0,0),(1,0,0,0),(0, 1, 1, l),and(l, 1, 1, 1). 

SE 6.12. Let X be a connected topological space, and assume that A and B are spatial 
regions in X. Then the intersection value for A and B in X cannot equal 
(0, 1,0.0). (Hint: Prove that A D B is a nonempty set that is both open and 
closed, and then apply Theorem 6.2.) 

Consider Theorem 2.21 again. We addressed parts of its the proof in Exercises 2.30 
and SE 6.12. The rest of Theorem 2.21 follows by showing that, if the intersection 
value for spatial regions A and B in a connected topological space X is as depicted 
in the left column in the following table, then the relationship between A and B is as 
depicted in the corresponding entry in the right column. We address these results in 
the subsequent exercises. 


Intersection Value 

Relationship 

(L L 1.0) 

AC B 

(0, 1. 1.0) 

A C Int(B) 

(1,1,0,1) 

B C A 

(0, 1,0, 1) 

B C Int(A) 

(1. 1,0,0)- 

A = B 
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The following lemma will be helpful: 

LEMMA 6.16. Let X be a connected topological space. Assume that C, D C X and 
that C is connected. If C D d D = 0, then either C C Int(D) or C Cl Cl(D) = 0. 

SE 6.13. Prove Lemma 6.16. 

The following three exercises address the first, second, and fifth rows in the foregoing 
table. Lemma 6.16 should provide some assistance. The proofs for the third and forth 
rows are similar to those for the first and second rows. 

SE 6.14. Let A and B be spatial regions in a connected topological space X. Prove 
that if the intersection value for A and B in X is (1,1,1,0), then A c B. 

SE 6.15. Let A and B be spatial regions in a connected topological space X. Prove 
that if the intersection value for A and B in X is (0,1,1,0), then A C Int (B). 

SE 6.16. Let A and B be spatial regions in a connected topological space X. Prove 
that if the intersection value for A and B in X is (1,1,0,0), then A — B. 


6.2 Distinguishing Topological Spaces 
via Connectedness 

In this section, we prove that Euclidean n-space in the standard topology is a 
connected topological space. We first show that the real line in the standard 
topology is connected. The general result for R" then follows by Theorem 6.10. 
We also show how the concept of connectedness can be used to help distin¬ 
guish between topological spaces (that is, determine that particular pairs of 
topological spaces are not homeomorphic). 

To prove that the real line is connected, we use the following properties of 
the real number system: 

(i) The Least Upper Bound Property: Every subset ofR that is bounded 
above has a least upper bound. 

(ii) If x, y e R with x < y then there exists z such that x < z < y. 


THEOREM 6.17. The real line R in the standard topology is a connected 
topological subspace. 

Proof. Suppose it is not, and let U and V form a separation of R. Pick 
u e U and v e V. We may assume u < v without loss of generality. Let 
U' = U fl [u, v] and V' = Vfl [u, u]. (See Figure 6.8.) It follows that 
f/'UV' = [u, uj. Since U' is bounded from above (by v ), U' has a least 
upper bound; call it c. We have u < c < v. We derive a contradiction 
by showing that c $ U' and c g V'. 

To show that c ft V\ we assume that c € V and derive a contradic¬ 
tion. Since u g V' and V' is open in [m, v], it follows that there exists 
d such that u < d < c and (d, c] C V. This implies that d is an upper 
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FIGURE 6.8: The sets U' and V' in [ u ,«]. 

bound of U' and that d is less than the least upper bound c. This is a 
contradiction and thus c g V'. 

Next we show that c $ U'. We do this by contradiction as well. 
Therefore, assume that c e U'. Since U' is open in [«, v\ and v g U', 
there exists d such that [c, d) c U'. For any e e ( c , d) it follows that 
e € U' and e > c, contradicting the fact that c is an upper bound of U'. 
Thus c £ U'. 

Therefore c & U r and c g V'. But c e [n, u] and U' U V' = [«, u]. 
With this final contradiction it now follows that R in the standard topology 
is a connected topological space. ■ 


Theorems 6.10 and 6.17 imply the following corollary: 

COROLLARY 6.1 8 . Euclidean n-space R" is a connected topological space. 


EXAMPLE 6.11 . All interval subsets of R are connected in R. Why? Open 
intervals (a, b), (—oo, b), and (a, oo) are connected because they are homeo- 
morphic to R. Theorem 6.8 then implies that all other interval subsets of R are 
connected because they can be obtained from open intervals by adding limit 
points. 


EXAMPLE 6.12. The n-sphere S n is a connected topological space. We 
prove this for the sphere S 2 , but a similar approach works to prove that 5" is 
connected. 

The stereographic projection function introduced in Example 4.16 is a 
homeomorphism between R 2 and the subspace S 2 — {A} of S 2 that is obtained 
by removing the north pole N. Since R 2 is connected, so is S 2 — {iV}. Thus by 
Theorem 6.8, the closure of S 2 — {IV} in S 2 is connected as well. That closure 
is, of course, the whole space S 2 , and therefore the sphere S 2 is a connected 
topological space. 


EXAMPLE 6.13. Since the interval / = [0,1] is connected, so is the square 
/ x I. The annulus, Mobius band, torus, Klein bottle, sphere, and projec¬ 
tive plane are all either obtained from a square by gluing pairs of edges or 
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homeomorphic to such a space. (See Figure 6.9.) Therefore there are quotient 
maps from the square onto each of these spaces, and these maps are continu¬ 
ous functions with a connected domain. It follows that each of these spaces is 
connected. 



FIGURE 6.9: Quotient spaces of the square are connected. 


EXAMPLE 6.14. The topological space R 2 — {O} is called the punctured 
plane (where O — (0, 0) is the origin in the plane). We show that it is con¬ 
nected. Each open half plane H+ = {(*, y) | y > 0} and H~ = {(x, y) \ y < 0} 
is homeomorphic to R 2 . (See Example 4.14.) Therefore H+ and H- are both 
connected in R 2 . By Theorem 6.8, we can add limit points to each and maintain 
the fact that they are connected. So to both H + and H- we add all of the x-axis 
except for the origin. The resulting sets are also connected in M 2 . Since the 
resulting sets intersect, their union is also connected, and their union is exactly 
R 2 - {6>}. 


Note that if p is a point in K 2 , then R 2 — {p} is connected since R 2 — {p} 
is homeomorphic to R 2 — {O}. 

Now that we have a collection of topological spaces that are connected, we 
demonstrate how connectedness can be used to distinguish between topological 
spaces. A line is not a circle is not a plane is not a sphere. How can we prove 
this? We will see that connectedness helps. 

As we indicated in Example 6.5, the removal of a point from the real line 
results in a disconnected topological space. This is an important enough notion 
that we provide a specific definition: 

DEFINITION 6.19. Let X be a connected topological space. A cutset of X 
is a subset S of X such that X — S is disconnected. A outpoint of X is a point 
p € X such that {p} is a cutset of X. A cutset or cutpoint of X is said to 
separate X. 
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EXAMPLE 6.15. The plane M 2 is connected. If we remove the circle S l , we 
are left with two disjoint nonempty open sets as shown in Figure 6.10. Hence, 
R 2 — S l is disconnected, implying that S 1 is a cutset of the plane. 

In fact, the Jordan Curve Theorem, a classic theorem in topology, asserts 
that every simple closed curve in the plane is a cutset of the plane. This general 
result is much more difficult to establish than the specific case of the circle S 1 . 
We prove the Jordan Curve Theorem in Chapter 11. 




FIGURE 6.10: The circle S 1 is a cutset of the plane M 2 . 


As we might expect from the situation depicted in Figure 6.11, if X is a 
connected topological space, then the boundary of a subset A of X is a cutset 
of X if both Int( A) and Int(X — A) are nonempty. (See Exercise 6.23.) 



FIGURE 6.11: The boundary of the set A is a cutset of X. 

The following theorem establishes that cutsets and cutpoints are preserved 
under homeomorphism. Hence we can use them to distinguish between topo¬ 
logical spaces. 

THEOREM 6.20. Let f : X Y be a homeomorphism. If S is a cutset of 
X, then f(S) is a cutset ofY. 

Proof. See Exercise 6.24. B 


It follows immediately from Theorem 6.20 that if / : X -> Y is a home¬ 
omorphism and p is a cutpoint of X, then f(p) is a cutpoint of Y. 
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EXAMPLE 6.16. Every point p in R is a cutpoint. In contrast, no point q in 
the plane S 2 is a cutpoint since R 2 — {q} is connected for all gel 2 . (See 
Figure 6.12.) Thus the line is not homeomorphic to the plane. 



FIGURE 6.12: R - {p} is disconnected, but R 2 - {q} is connected. 


EXAMPLE 6.17. For every q e S l , the space S 1 — {q} is homeomorphic to 
R. Since R is connected, S l — {q} is connected as well. Therefore no point 
in the circle S 1 is a cutpoint. (See Figure 6.13.) But since every point in the 
line is a cutpoint, it follows that the circle is not homeomorphic to the line. 
Furthermore, for every r e S 2 , the space S 2 — {r} is homeomorphic to the 
plane and therefore is connected. Thus no point in the sphere S 2 is a cutpoint, 
implying that the sphere is not homeomorphic to the line. 



FIGURE 6.13: R - {p} is disconnected, but S 1 - { q } and S 2 - [r] are connected. 


EXAMPLE 6.18 . Every two-point set [p, p'} in the circle separates the circle 
into two open intervals, and therefore every two-point set is a cutset of the 
circle. (See Figure 6.14.) However, if we remove a two-point set {q, q'} from 
the sphere, we obtain a space that is homeomorphic to the punctured plane, 
R 2 — { O }. Since R 2 — { O } is connected, it follows that no two-point set in the 
sphere is a cutset. Thus, the circle is not homeomorphic to the sphere. 
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FIGURE 6.14: 5* - { p , p') is disconnected, but S 2 - { q , q'} is connected. 


So far we have distinguished the line from the plane, circle, and sphere, and 
we have distinguished the circle from the sphere. We can similarly distinguish 
the circle from the plane. (See Exercise 6.29.) However, when we consider 
the sphere and the plane, we find that no one-point set, two-point set, or even 
countably infinite set separates either. (See Exercise 6.43.) Therefore, we 
cannot distinguish the sphere from the plane by removing simple sets, as in 
the previous examples. We show in 'Chapter 7 that the topological property 
of compactness enables us to distinguish between these two spaces. (See 
Exercise 7.17.) 

Exercises for Section 6.2 

6.17. (a) Show that open balls and closed balls in the standard metric on R" are 
connected in R". 

(b) Prove that each of the sets, A, B, and C, appearing in Figure 6.15, is a 
connected subset of the plane. (The set C consists of eight open balls and 
the eight points of tangency between them, as shown.) 



FIGURE 6.15: Prove that each of these subsets of the plane is connected. 

6.18. Give examples of subsets A and B in R 2 such that 

(a) A and B are connected, but A 0 B is not. 

(b) A and B are connected, but A — B is not. 

(c) A is connected, B is disconnected, and A fl B is connected. 

(d) A and B are disconnected, but A U B is connected. 

(e) A and B are connected, C1(A) fl C1(B) ^ 0, and A U B is disconnected. 

6.19. Let B c R” be bounded in the Euclidean metric. Prove that the complement 
of B in R" has exactly one unbounded component. 

6.20. In each of the following cases, prove whether or not the given set C is a cutset 
of the connected topological space X: 

(a) C = {b } and X = {a, b, c} with topology {0, [b], {a , b }, { b , c), X). 
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(b) C = {c} and X is the same as in (a). 

(c) C = {0} and X = P PR o> the particular point topology on R with the 
origin as the particular point. 

(d) C = {—1,1} and X = R/ c , the real line in the finite complement topology. 

(e) C = the circle S 1 and X = R 2 . 

(f) C = the core curve in the Mobius band X, as shown in Figure 6.16. 



FIGURE 6.16: Is X - C connected? 

6.21. Prove that each n e Z is a cutpoint of the digital line. 

6.22. Can you cut a Klein bottle into two Mobius bands? Find a cutset C for the 
Klein bottle K such that 

(i) C is a simple closed curve in K, and 

(ii) K — C is a union of two disjoint open sets such that the closure of 
each in K is homeomorphic to a Mobius band. 

Let X be a connected topological space and A be a subset of X. Prove that if 
Int(A) and Int(X - A) are nonempty, then 3 A is a cutset, and the pair of sets, 
Int(A) and Int(X - A), is a separation of X — 3A. 

Prove Theorem 6.20: Let / : X ->■ Y be a homeomorphism. If S is a cutset 
of X, then /(S) is a cutset of Y. 

Prove that for every n > 2 the line is not homeomorphic to R". 

Prove that for every n >2 neither the line nor the circle is homeomorphic to 

S n . 

6.27. Consider [0,1], [0,1), and (0,1) as subspaces of R in the standard topology. 
Prove that no two of these spaces are homeomorphic to each other. (This result 
establishes the claim made in Example 4.12 that, given the three collections of 
intervals defined in the example, there is no homeomorphism from an interval’ 
in one collection to an interval in another.) 

6.28. (a) Prove that the interval [0, 2n) is not homeomorphic to the circle S 1 . (This 

exercise establishes the claim made in Example 4.13 that there is no home¬ 
omorphism between these two spaces.) 

(b) Prove that no interval subspace of R is homeomorphic to the circle S 1 . 

6.29. Prove that no two-point set separates the plane, and use that result to argue that 
the circle is not homeomorphic to the plane. 

6.30. Consider the following definition: 

DEFINITION 6.21. Let X be a connected topological space, and assume that 
n € Z+ with n > 2. A point p in X is said to be a local cutpoint of order 
n in X if there is a connected neighborhood U of p such that U — {p} has n 
components. 

Prove that if / : X —> Y is a homeomorphism and p is a local cutpoint of order 
n in X, then f(p) is a local cutpoint of order n in T. 


6.23. 

6.24. 

6.25. 

6.26. 
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6 . 31 . Begin with a collection of topological graph representations of the letters of the 
alphabet, grouped into equivalence classes by topological equivalence. (See 
Exercise 4.37.) Use Exercise 6.30 to verify that if two representations are in 
different equivalence classes in your grouping, they are not homeomorphic. 


6.3 The Intermediate Value Theorem 

One of the most important theorems in calculus is the Intermediate Value 
Theorem. It indicates that if a continuous function maps a closed interval 
[a, b] to the real line R, then the function takes on every value between the 
values f(a ) and f(b). 

THEOREM 6.22. The Intermediate Value Theorem on [a, b]. Let 

f : [a, b] -*■ R be continuous, and assume that s lies between f (a) and f (b). 
Then there exists at least one c e [a, b] such that f(c) = s. (See Figure 6.17.) 



FIGURE 6.17: There exists c e [a, b] such that /(c) = s. 


The Intermediate Value Theorem on [a, b] follows as a consequence of a 
more general version (Theorem 6.24) that we will subsequently prove. 


EXAMPLE 6.19. Weighing 320 pounds on January 1, 2001, Jared made a 
New Year’s resolution to change his lifestyle in order to lose some weight. 
So he decided to stop having lunch at his favorite burger joint, and he started 
frequenting the sandwich shop across the street. Amazingly, Jared lost 50 
pounds over the next 30 weeks. If we let W (t) be Jared’s weight as a function of 
time t, in weeks, since he started his diet, then W (0) = 320 and W (30) = 270. 
For every weight between 270 and 320, there was a time when Jared weighed 
that much. In particular, there was a time when he weighed 300 pounds. In 
fact, he may have weighed 300 pounds a few times, but the Intermediate Value 
Theorem tells us only that he weighed exactly that much at least once over the 
30-week time interval. 


An example of the utility of the Intermediate Value Theorem is given in 
the following corollary: 
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COROLLARY 6.23. Let f : [a, b] —>■ R be a continuous function such that 
f(a ) and f(b) have opposite signs. Then the equation fix) = 0 has a solution 
between a and b. 

Corollary 6.23 assists us in finding solutions to an equation in the form 
f(x) = 0. For example, if / is a continuous function on R and we find 
an interval where / is negative at one end and positive at the other, then 
Corollary 6.23 guarantees that there is a solution to the equation f(x) = 0 in 
the interval. 

Although the Intermediate Value Theorem on [a, b] is first encountered in 
calculus, the theorem can be stated in much greater generality. It then becomes 
apparent that, in fact, this is a theorem from topology. 

THEOREM 6.24. The Intermediate Value Theorem (General Ver¬ 
sion). Let X be a connected topological space and f : X -> R be con¬ 
tinuous. If p,q e f(X) and p < r < q, then r e f(X). 

Proof. Suppose f : X -*■ R is continuous, p,q e f(X), and p < 
r < q. If r = p or r = q, then we immediately have that r e f(X). 
Therefore, we only need to consider the case where p < r < q. Note 
that f(X) is connected in R since X is connected and / is continuous. 
We prove by contradiction that ref ( X ). Thus, suppose that r g f (X). 
Then U = (—oo, r) and V = (r, oo) are disjoint open subsets of R 
whose union contains f(X). Since p e U and q e V, it follows that 
f(X ) intersects both U and V. Hence, U and V form a separation of 
f(X) in R, as illustrated in Figure 6.18. But this contradicts the fact that 
f(X) is connected in R. Therefore r e f{X). ■ 


U = (~°°, r) 

P 




V = (r, oo) 



FIGURE 6.18: If r £ /(X), then V = (-oo, r) and V = (r, oo) form a separation of 
/(X). 

The Intermediate Value Theorem on [a, b ] follows from the general version 
if we let X be the interval [a, b\ and let p and# be f(a) and / (b), respectively. 


EXAMPLE 6.20. Let the surface of Lake Louise be represented by a topo¬ 
logical space X, and let / : X -»■ R be the depth of the lake beneath each point 
on the surface. The maximum depth of Lake Louise is 70 meters. Is there a 
point where the depth is 30 meters? 

Note that / is a continuous function. Furthermore, X is topologically a 
disk, and therefore is connected. (See Exercise 6.17.) Hence the Intermediate 
Value Theorem applies. There are points where the value of / is 0, namely 
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points on the shore of the lake. There is also at least one point where the value 
of / is 70. Hence, given any depth between 0 and 70, there must be at least 
one point where the lake has that depth. In particular, there is at least one point 
where the depth is 30 meters. 


A useful consequence of the Intermediate Value Theorem is the following 
one-dimensional version of the Brouwer Fixed Point Theorem: 

THEOREM 6.25. The One-Dimensional Brouwer Fixed Point The¬ 
orem. Let f : [—1,1] -*■ [—1,1 ] be continuous. There exists at least one 
c € [— 1, 1] such that f(c) = c. 

This theorem asserts that every continuous function / : [—1, 1] —► [—1, 1] 
maps some c e [— 1,1] to itself. Such a point c is called a fixed point of /. 

We can easily visualize why the theorem holds. (See Figure 6.19.) As we 
trace out the graph of /, going from the left side of the square [— 1,1 ] x [—1,1] 
to the right side, there must be a point ( c, c) where the graph intersects the line 
y = x. Such an intersection corresponds to a value c satisfying f(c) = c, as 
desired. 



FIGURE 6.19: A fixed point of / occurs where the graph of / intersects the line 
y = x. 


We make this pictorial argument formal in the following proof: 

Proof. Let / : [—1, 1] —>• [—1, 1] be continuous. Define a function 
g : [—1,1] -*■ R by g(x) = f(x) — x. The function g is continuous. 
Note that /(— 1) > —1, and therefore g(— 1) > 0. Similarly g(l) < 
0. The Intermediate Value Theorem implies that there exists a value 
c € [—1,1] such that g(c) = 0. For such c it follows that /(c) = c. 
Therefore there exists at least one c in [—1, 1] such that /(c) = c, as we 
wished to show. ■ 


The n-Dimensional Brouwer Fixed Point Theorem states that every con¬ 
tinuous function / : B n -> B n , mapping the n-ball to itself, has a fixed point 
(that is, a point x where f(x) = x). We prove the two-dimensional version of 
it in Chapter 10 when we consider fixed point theory in general. 
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EXAMPLE 6.21 . Application to Population Modeling. In the held of ecol¬ 
ogy, population models of various types—including anywhere from one to 
many species—are investigated both experimentally and mathematically. How 
a species grows and declines, how it interacts with its environment, and how it 
can be managed are all factors that are incorporated into and investigated via 
models. Many mathematical tools are used in developing and analyzing such 
models. When a particular model calls for a qualitative analysis, it is often 
topological tools that are employed. 

Here we consider a straightforward single-species model exhibiting a par¬ 
ticular type of behavior that can be identified via the Intermediate Value The¬ 
orem. 

Let y(t) represent the size of the species population as a function of time 
t. We assume that the population grows at a rate continuously dependant on 
the size of the present population. Thus ^ = f(y ) and / is continuous. The 
function / is the growth function. It reflects how the rate of growth depends 
on the population. 

A simple example of such a model is given by ^ = ky, where k > 0. 
Here the rate of growth is proportional to the size of the population. This model 
applies when a population has no restriction on its resources and no predators. 
The family of solutions to this differential equation is given by y — yoe kl . This 
result yields exponential growth for the population. 

For other instances of the growth function f(y), we may not be able to 
determine the general form of the solution. Approaches other than directly 
solving the equation are then needed to understand the behavior of the solutions. 

In investigating such models, it is often of interest to find solutions that 
are constant in time—also known as steady-state solutions. Such solutions 
can represent the expected long-term behavior of a system, usually appearing 
as the limiting behavior of time-varying solutions. Thus we are interested 
in steady-state solutions because they often correspond to stable states toward 
which other solutions tend. A steady-state solution y (t) = y* occurs whenever 

= 0 and thus at a value y* such that f(y*) = 0. 

Let us consider a specific example. Let y(t) represent the wolf population 
in an area of Montana where wolves are being reintroduced into the wild. We 
would like to know if it is possible for there to be a steady-state population of 
wolves. 

We make two straightforward and natural assumptions about the corre¬ 
sponding growth function /: 

(i) For small numbers of wolves, enough resources are available to 
support the growth of the population. 

(ii) For large numbers of wolves, the resource base is not sufficient to 
sustain growth, and consequently the population declines. 

Thus we assume that f(y) > 0 for small values of y, and that f(y) < 0 
for large values of y. Let us take two such values: S, close to 0, for which 
/(S) > 0, and L, large, for which f(L) < 0. 
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Now, consider the y versus t coordinate system sketched in Figure 6.20. 
Solutions to the differential equation ^ = f(y) have slopes specified by /(y). 
That is, a solution curve passing through the point (f, y) has slope f(y). In 
particular, by our assumptions, the slope of each solution curve is positive at 
y = S and negative at y = L, as indicated in Figure 6.20. 



FIGURE 6.20: Slopes and solution curves described by ^ = /(y) where /(S) > 0 
and f(L) < 0. 

In the fy-plane, let B be the band defined by 

B = l(t,y) 1 1 > 0, S <y < L}. 

It is apparent in Figure 6.20 that solution curves that enter B must remain in 
B. What happens within the band B1 Without more information about the 
behavior of the growth function /, not much can be concluded. However, 
the Intermediate Value Theorem does guarantee that, since /(S) > 0 and 
f(L ) < 0, there exists a value y* € [5, L] such that f(y*) = 0. (See 
Figure 6.21.) Thus there is a steady state solution y(t) = y*. Its graph is a 
horizontal line in the band B. Hence, under the assumptions of growth for small 
population values and decline for large population values, we can conclude that 
there is at least one population value at which the wolves survive in a steady 
state. 



j 
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In the preceding example, we performed a simple qualitative analysis of 
a differential equation. In general, the study of the qualitative behavior of 
solutions of differential equations is a part of the field of dynamical systems. 
(We investigate another aspect of dynamical systems theory in Chapter 8.) Our 
previous analysis can be generalized to a straightforward dynamical systems 
result: Suppose ^ = f{y) is defined on [a, b\, if / is continuous and such 
that f(b) < 0 and f(a) > 0, then there is a steady-state solution y(t) — y* 
with y* 6 [a, b]. This is typical of the type of qualitative conclusions about 
differential equation solutions that can be made using the tools of topology. 
Even if we are unable to specify the solutions to a differential equation, a 
topological analysis often allows us to make conclusions about their behavior. 

The general version of the Intermediate Value Theorem has a variety of 
other interesting implications. For instance, we have the following theorem: 

THEOREM 6.26. Let f : S 2 —»■ R be continuous. There exists c e S 2 such 
that f(c) = f ( c). 

Proof. Let / : S 2 -»• R be continuous. Define g : S 2 -» R by g(x) = 
f(x) — f(—x). The value of gat a point x e S 2 is the difference between 
/ at jc and / at the antipode of x. Note that g is continuous, the domain 
S 2 is connected, and g(x) = —g(—x) for all jc e S 2 . Pick a point p € S 2 . 
If g(p) = 0, then f(p) = f(—p) and we are done. Otherwise, g(p) and 
g(—p) are of opposite sign, and the Intermediate Value Theorem implies 
that there exists q € S 2 such that g{q) — 0. For such q, it then follows 
that f(q) = f(—q). In either case igip) = 0 or g(p) f 0), it follows 
that there exists c e S 2 such that /(c) = /(—c), establishing the desired 
result. ■ 


EXAMPLE 6.22. Theorem 6.26 has a curious application to meteorology. 
Assuming that the surface of the Earth is a sphere, and surface temperature is a 
continuous function defined on the sphere, at any given time there is a point on 
Earth that has the same temperature as the point directly opposite it. Of course, 
we could replace temperature with any other continuous variable defined on 
the Earth, such as barometric pressure, relative humidity, or elevation, to obtain 
a corresponding result. 


Theorem 6.26 can be generalized to a result for continuous functions 
/ : S n -» R m with n > m. This general result is known as the Borsuk-Ulam 
Theorem. Theorem 6.26 addresses the case for n — 2 and m = 1. For m = 1 
and an arbitrary n, the proof of Theorem 6.26 carries over exactly as presented. 
However, the proof of the general Borsuk-Ulam Theorem requires topological 
tools beyond the scope of this text. 

We prove the Borsuk-Ulam Theorem for m — 2 and n = 2 in a set of 
supplementary exercises in Chapter 9. The m = 2 and n = 2 case applies 
to functions / : S 2 -> R 2 and provides a stronger meteorology result than 
the one presented in Example 6.22. Specifically, at any given time there are 
antipodal points on the Earth’s surface that have both the same temperature 
and the same barometric pressure! 
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Exercises for Section 6.3 

6 . 32 . Let T : S 2 -> R be defined by equating the sphere with the surface of the Earth 
and letting T (x) be the temperature at point x on the surface at some given time. 
Assume that T is a continuous function. Show that if 7(Anchorage) = —30° 
and T (Honolulu) = 80°, there is some point on the Earth where the temperature 
is 0°. Does the conclusion necessarily hold if we restrict the domain to the fifty 
United States? Does the conclusion hold in the United States if 7"(Duluth) = 
-30° and 7(Fort Lauderdale) = 80°? 

6 . 33 . Let p(x) be an odd-degree polynomial function. Prove that p(x) = 0 has at 
least one real solution. 

6 . 34 . Determine whether or not the general version of the Intermediate Value Theorem 
holds when the range R is given each of the following topologies: 

(a) The trivial topology 

(b) The discrete topology 

(c) The lower limit topology 

6 . 35 . State and prove an intermediate value theorem for functions mapping into the 
digital line. 

6 . 36 . Suppose that at a given time we measure the intensity of sunlight at each point on 
the Earth’s surface. According to Theorem 6.26, there must be a pair of points 
opposite each other on the Earth’s surface at which the intensity of sunlight is 
the same. However, if it is daytime at one point, it must be nighttime at the 
point opposite it! Resolve the paradox. 

6 . 37 . Show that on every great circle on the surface of the Earth (obtained by inter¬ 
secting the surface of the Earth with a plane through its center), there are two 
opposite points with the same temperature. 

6 . 38 . (a) Let X be connected, and assume a homeomorphism A : X -*■ X exists 

such that A o A(jc) = x for all x e X. Prove that, for every continuous 
function f : X -*■ R, there exists x e X such that f(x) — f(A(x)). 

(b) Use the result from part (a) to prove that somewhere on a glazed doughnut 
there is a point that has the same thickness of glazing as the point obtained 
by 180-degree rotation through the central axis of the doughnut. 


6.4 Path Connectedness 

In this section we introduce path connectedness, the second of the two ap¬ 
proaches to connectedness that we mentioned at the beginning of the chapter. 
We show that path connectedness implies connectedness. Furthermore, we 
exhibit a topological space (the topologist’s whirlpool) that is connected but 
not path connected. Thus path connectedness is a stronger condition on a 
topological space than connectedness. 

Let X be a topological space and x and y be points in X. In Section 4.3 
we defined a path from x to y in X to be a continuous function / : [0, 1] -» X 
such that /(0) = x and /(1) = y. 
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DEFINITION 6.27. A topological space X is path connected if for every 
x, y € X there is a path in X from x to y. A subset A of a topological space 
X is path connected in X if A is path connected in the subspace topology that 
A inherits from X. 

Clearly R" is path connected, as is every open ball and every closed ball 
in E”. 

THEOREM 6.28. IfX is a path connected space, then it is connected. 

Proof. Let X be a path connected space. We prove that X is connected 
by showing that it has only one component, or equivalently that every pair 
of points x, y € X is contained in some connected subset of X. Thus, 
let x and y be arbitrary points in X. Since X is path connected, there is 
a path in X from x to y. The image of such a path is a connected subset 
of X containing both x and y. Therefore every pair of points in X is 
contained in a connected subset of X , and it follows that X is connected. 


Since path connectedness implies connectedness, and since R in the stan¬ 
dard topology is clearly path connected, it might appear that we now have 
a simpler way to prove the connectedness of R than that presented in Theo¬ 
rem 6.17. However, there is a flaw in this argument. The proof of Theorem 6.28 
uses the fact that the interval [0, 1] is connected, and the proof that [0, 1] is 
connected uses the fact that R is connected. Thus the proof of Theorem 6.28 is 
actually built on the connectedness of R. Consequently, Theorem 6.28 cannot 
be used to prove that R is connected. 

On first inspection, it may be natural to think that the properties of con¬ 
nectedness and path connectedness are equivalent, but in fact they are not. In 
this next example we see a topological space that is connected but not path 
connected. 


EXAMPLE 6.23. The topologist’s whirlpool. Let A be the planar curve, 
expressed in (r, 0) polar coordinates as A — 0) \ 0 € (0, oo)}. The 

curve A spirals outward toward the circle S' 1 as shown in Figure 6.22. Define 
W, the topologist’s whirlpool, to be the set W = A U S l . It is straightforward 
to show that W is the closure of A in R 2 . 

Note that A is the continuous image of the connected space (0, oo) and 
therefore A is connected. Since W is the closure of A in R 2 , it too is connected. 

However, W is not path connected. The proof is somewhat intricate, but 
well worth the effort. To prove that W is not path connected, we show that 
every path in W that begins in S 1 must stay in S 1 , and therefore there is no 
path in W from a point in S 1 to a point in A. Thus, let p : [0,1] -» W be 
a path such that p\ 0) e S 1 . We show that p([0,1]) c S 1 , or, equivalently, 
that p~ l (S l ) = [0, 1]. First, note that p _1 (S 1 ) is a nonempty subset of 
[0,1] since p( 0) e S 1 . We claim that p~ l (S l ) is both open and closed in 
[0,1]. Since [0, 1] is connected, it would then follow from Theorem 6.2 that 
p~ l (S l ) = [0, 1]. Thus we need to prove that p _1 (5 1 ) is both open and closed 
in [0, 1], 
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FIGURE 6.22: The topologist’s whirlpool. 


First, note that 5 1 is closed in W and /? : [0,1] —> W is continuous. 
Therefore p~' (S 1 ) is a closed subset of [0, 1]. 

Let B be the basis for the topology on [0, 1] obtained by taking the open 
intervals in K and intersecting them with [0, 1], Each basis element in B is an 
interval subset of [0, 1]. 

To see that p -1 (S 1 ) is an open subset of [0,1], choose an arbitrary point 
y € p -1 ^ 1 )- Thus p(y) € S 1 , and p(y) has a polar-coordinate representation 
of the form (1, y). Let N c M 2 be the polar neighborhood of p(y) given by 

N = {(r, 0)|O.9 < r < 1.1, y -0.1 < 9 < y +0.1}. 

(The specific numerical values in this choice of N are not significant; we just 
need a small neighborhood of p(y).) The set N' — W n N is an open set 
in W. Thus p~ l (N r ) is open in [0, 1] and contains y. Therefore there is a 
basis element / € B such that ye/ c p~ l (N'). Being an interval, the basis 
element J is connected. Hence p(J) is a connected subset of N'. We show 
that p(J) cS'fl N'. Notice that p(J) has at least one point in common with 
S 1 n N', namely p(y). 

Now, in the interval (0.9, 1) we can choose a sequence (r„) that converges 
to 1 and is such that the sets 

U n = {(r, 9) € N | r > r n } and V n = {(r, 9) e N \ r < r n } 

form a separation of N' in the plane. (See Figure 6.23.) Lemma 6.7 implies 
that p(J) C U n for all n. Thus p(J) C S 1 n N'. Therefore we have y € J C 
p _1 (S 1 ), and / is a basis element for the topology on [0, 1], This implies that 
p -1 ^ 1 ) is open in [0, 1], as we wished to show. 

Thus W is not a path connected space. As we have already indicated, it is 
connected. Therefore the topologist’s whirlpool is an example of a topological 
space that is connected but not path connected. 




EXAMPLE 6.24. The topologist’s sine curve is the subspace of R 2 defined 
by 

T = {(.x, sin(i) | 0 < * < 1} U {(0, y)\ - 1 < y < 1}. 

It is shown in Figure 6.24. The topologist’s sine curve is another space that is 
connected but not path connected. (See Exercise 6.42.) 



FIGURE 6.24: The topologist’s sine curve. 


It is not difficult to see that if we poke a hole in R 2 , the resulting space is 
path connected. Specifically, let p be a point in R 2 and consider R 2 — {p}. If 
a and b are points in R 2 — {p}, and p does not lie on the line segment between 
a and b, then there is a straight-line path from a to b in R 2 — {p}. It is given 
by /(f) = (1 — t)a + tb. Otherwise, we can find a path between a and b in 
R 2 — {p} by choosing a third point c that is not on the line through a, b, and 
p, and following a two-segment dogleg, from a to c to b, around the missing 
point p. (See Figure 6.25.) Such a path is given by 

1 (1 — 2 t)a + 2 tc for 0 < t < 1/2, 

(2 — 2t)c + (2t — V)b for 1/2 < f < 1. 


g(0 = 
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FIGURE 6.25: The space R 2 - {p} is path connected. 


Note that g is continuous by the Pasting Lemma. Thus R 2 — {/>} is path 
connected. 

The preceding argument can also be used to show that for every n > 2 and 
p 6 R'\ the space R" — [p] is path connected. Moreover, we need not limit 
ourselves to one pinprick in R". Even putting a countably infinite number of 
pinpricks in R" results in a space that is path connected. (See Exercises 6.43 
and 6.44.) 

The following theorem indicates that the image of a path connected space 
under a continuous function is path connected. This implies that path con¬ 
nectedness is preserved under homeomorphism and therefore is a topological 
property. 

THEOREM 6.29. Assume that f : X -* Y is continuous and X is path 
connected. Then f(X ) is a path connected subspace ofY. 

Proof. Let pandq be points in f(X). Pick points* e / -1 ({/?)) and y e 
/~* ({<?})• Since X is path connected, there exists a path g : [0, 1] -> X 
from * to y. Then / o g is a path in /(X) from p to q. ■ 

Now, as is the case with connectedness, a topological space X can be 
partitioned into maximal path connected subsets. These are known as the path 
components of X. We make this precise in what follows. 

Given a topological space X, define a relation on X by * ~ p y if there 
exists a path in X from * to y. This relation is an equivalence relation, which 
we now check. 

(i) Is it true that * ~ p xl Yes, we can take the path / : [0, 1] -» X 
given by f(t) = * for all t. 

(ii) Is it true that x ~ p y implies y ~ p xl Yes. If / : [0,1] —► X is a 
path from * to y, then g : [0,1 ] -» X, defined by g(t) = /(I — t), 
is a path from y to x. 

(iii) Suppose * ~ p y and y ~ p z- Is it true that * ~ p z? Yes. We 
know that there are paths / and g between * and y and between y 
and z, respectively. Form a new path h : [0,1] X, defined by 


hit) = 


/(2f) for 0 < r < 1 /2, 
g(2t - 1) for 1/2 < f < 1. 
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Then h( 0) = x and /i(l) = z. Since /(1/2) = y = g( 1/2), the 
Pasting Lemma implies that h is continuous. Thus, h is a path from 
x to z. 

Therefore ~p is an equivalence relation. As usual, the equivalence classes 
form a partition of X. 

DEFINITION 6.30. The equivalence classes under the equivalence relation 
~ p are called the path components ofX. 

The path components of X are path connected, and every path connected 
subset of X is a subset of a path component of X. (See Exercise 6.47.) In 
this sense, the path components of a topological space X are the maximal path 
connected subsets of X. 


EXAMPLE 6.25. Let X n be the subset of the plane consisting of the spiral 
{(g+T. <?) € M 2 | 0 € (0, oo)} and n equally-spaced points in the circle S' 1 . We 
illustrate X$ in Figure 6.26. With arguments similar to those in the topologist’s 
whirlpool example, we can show that X„ is connected but not path connected. 
It follows that X„ has one component, and it is straightforward to prove that 
X„ has n + 1 path components. 



FIGURE 6.26: The set X$ consists of a spiral and six equally-spaced points in the 
circle surrounding it. 


j 


It is generally true that the number of components of a topological space 
does not exceed the number of path components since each path component 
must be a subset of some component of the space. (See Exercise 6.49.) 

Exercises for Section 6.4 

6.39. Let X = [a, b } and assume that X has the topology T = [0, [a], X}. Show 
that X is path connected in this topology. 

6.40. Prove that the digital line is path connected. 

6.41. A set C c R" is said to be star convex if there exists a point p* e C such that 
for every p e C, the line segment in R" joining p* and p lies in C. Prove that 
if C c R" is star convex, then C is path connected in R". 
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6.42. Prove that the topologist’s sine curve is connected but not path connected. 

6.43. Let A = R 2 — C where C is a countable subset of R 2 . Prove that A is path 
connected. (Hint: Given a point a 6 A, there are many lines through a that 
miss C entirely. Why?) 

6.44. Let A = R" — C where C is a countably infinite subset of K". Prove that A 
is path connected. (Hint: Prove this by induction, using Exercise 6.43 as an 
initial step.) 

6.45. Prove that the n-sphere S n is path connected for n > 1. (Hint: Find a surjective 
continuous function from a path connected space to S ".) 

6.46. Prove that if [A a } ae A is a collection of path connected subsets of a topological 
space X, and HaeA is nonempty, then (J a£j4 A a is path connected. 

6.47. Let X be £ topological space. 

(a) Prove that each path component of X is path connected. 

(b) Prove that each path connected subset of X is a subset of some path com¬ 
ponent of X. 

6.48. Provide an example showing that the path components of a topological space 
X need not be open sets in X, nor closed sets in X. 

6.49. Prove that each path component of a topological space X is a subset of some 
component of X. 

6.50. Provide an example of a connected topological space having uncountably many 
path components. 

6.51. Prove that if / : X —> Y is a homeomorphism and C is a path component of 
X, then /(C) is a path component of Y. 

6.52. Let .Xj,... , X n be path connected. Prove that the corresponding product space 
X\ x ... x X n is path connected. 


6.5 Automated Guided Vehicles 

Automated guided vehicles are mobile robots that are used to transport materi¬ 
als from location to location in a manufacturing facility. Part of the challenge 
in designing and constructing such a facility is properly setting up mobile robot 
routes so that the robots can maneuver in an efficient and safe manner. The 
tools and concepts of topology are naturally employed in this planning process. 
In this section we consider some simple examples. 

We model each robot with a point that moves through a topological space 
representing the robot routes in the factory. To begin, suppose we have two 
robots A and B that move on a line represented by R, as shown in Figure 6.27. 
Let xa indicate the location of robot A and x# the location of robot B. The 
configuration space for the two robots (see Section 3.5) is the space 

C = {( xa , xb) | xa e R, xb e R} = R x R = M 2 . 
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X A X B 



FIGURE 6.27: Two robots moving on the line yield a configuration space that is R 2 . 

To prevent robot collisions, we do not allow the two robots to occupy the 
same point at the same time. That is, we never let xa = xb- The resulting 
space of permitted configurations is then 

SC = {Cm, xb ) | M € K, xb g K, xa 7 ^ xb }- 

To distinguish this space from the configuration space C, we call this the safe 
configuration space for the robots. 

The points in the plane that are excluded from the safe configuration space 
comprise the set {(jc, jc) | x 6 R). This set is called the diagonal of the plane 
and is denoted by A. Note that SC = C — A. In this case the safe configuration 
space is the plane with the diagonal removed. (See Figure 6.28.) It is important 
to note that this space is not connected and therefore not path connected. We 
will shortly see the significance of this fact. 



FIGURE 6.28: The safe configuration space for two robots on the line is the plane 
with the diagonal removed. 

In the general case, suppose that we have n robots in space X. The safe 
configuration space is defined to be the space 

SC n (X) = (X x X x ... x X) - A, 


where 


A = {(*i ,X2,... , x n ) | xi = xj for some i ^ j}. 
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Here we are taking the product of n copies of X and removing the diagonal A. 
As in the two-robot case, by removing the set A we are excluding configurations 
where two or more robots occupy the same point and are therefore eliminating 
the possibility of robot collisions. 

To model a relocation of the robots in space X, we use a path in SC n (X). 
Specifically, assume that our n robots are initially positioned so that robot j is 
located at I j, and we wish to move the robots so that the final position of robot 
j is Fj. We define a relocation of the n robots from the initial configuration 
/ = to the final configuration F — (F\,... , F n ) to be a path 

p : [0, 1] -» SC n (X ) such that p( 0) = I and p(l) = F. The existence of 
such a path ensures that the robots can be moved from the initial configuration 
to the final configuration without collision. 

Given two configurations of the robots, M — (M\ ,... , M n ) and N — 
(N\,... , N n ), we say that N is attainable from M if there is a relocation of 
the robots from M to N. If the safe configuration space is path connected, 
then every configuration is attainable from every other one, and we say that the 
robots are freely transportable in X. 


EXAMPLE 6.26. Returning to the example of two robots on the line, the safe 
configuration space is SC 2 (R) = R 2 — A. As we already observed, this space 
is not path connected, so the two robots are not freely transportable in R. This 
is intuitively clear, since no configuration with robot A to the left of robot B is 
attainable from any of the configurations with robot B to the left of robot A. 

This intuitive observation is reflected in the structure of the safe configu¬ 
ration space. In particular, the path components of SC 2 (R) are the sets 


U = {(xa,xb) | xa < xb e R} and V = {(^, xb ) I xa > x B e R], 

which lie above and below the diagonal, respectively, in Figure 6.28. The set 
U represents all of the configurations where robot A lies to the left of robot 
B, and the set V represents all of the configurations where robot B lies to the 
left of robot A. Since there is no path in 5C 2 (R) from U to V, it follows—as 
we have observed intuitively—that no configuration with robot A to the left 
of robot B is attainable from any of the configurations with robot B to the 
left of robot A. Nonetheless, since there is a path in U between every pair of 
configurations in U, it follows that each configuration with robot A to the left 
of robot B is attainable from every other such configuration via a relocation 
that keeps robot A to the left of robot B. The same holds true for each pair of 
configurations in V. 

In Figure 6.29 we illustrate two relocations of the two robots from an 
initial configuration {I a, Ib ) to a final configuration (Fa, Fb ) in U. Since the 
configurations are in U, robot B is to the right of robot A. In the first relocation, 
both robots move to the right, with robot B traveling further overall than robot 
A, a situation that is reflected in the fact that the slope of the segment between 
(I a, Ib) and (Fa, Fb) is greater than 1. In the second relocation, the robots 
move toward each other, and since they are moving in opposite directions, the 
slope of the segment between (I a, Ib) and (Fa, Fb) is negative. 




EXAMPLE 6.27. Consider two robots on the circle S 1 . Here the safe con¬ 
figuration space 5C 2 (S‘) is a torus, S 1 x S ] , with the diagonal removed. To 
picture the safe configuration space, consider Figure 6.30. There we represent 
the torus as a square with opposite edges identified. The removed diagonal 
is represented by the dashed line. Removal of the diagonal results in a space 
formed by gluing two triangles along the edges as shown. The corresponding 
space is homeomorphic to (0, 1) x S 1 , an open annulus. 



FIGURE 6.30: The safe configuration space SC 2 (S*) is an open annulus. 

An open annulus is path connected since it is homeomorphic to a product 
of two path connected spaces ((0, 1) and S 1 ) and since the product of path 
connected spaces is path connected. (See Exercise 6.52.) Therefore we can 
conclude that two robots on S 1 are freely transportable. 


EXAMPLE 6.28. This is based on an example from [Abr2], Consider the 
configuration space for two robots A and B on the F-shaped topological graph 
shown in Figure 6.31. We denote the space by F and the three edges of Y by 
a , f3, and y. The three edges intersect at the vertex v. 

The safe configuration space is SC 2 (F) = (Y x Y) — A. In order to 
picture SC 2 (F) we break it up into three subsets: (F x a) — A, (F x /3) — A, 
and (F x y ) — A, then we examine how the three subsets glue together to form 
SC 2 (F). 
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FIGURE 6.31: Two robots on the space Y. 


In Figure 6.32 we depict (F x a) — A. The set of configurations cor¬ 
responding to robot B fixed at v and robot A located anywhere else on Y is 
represented by the edges at the bottom of the pictured space. Along the edge 
marked with a single arrow, robot B is fixed at v and robot A lies along edge 
a. We can interpret the edges marked with a double or triple arrow in a similar 
fashion. Moving vertically in the illustration of (F x a) — A corresponds to 
moving robot B outward along edge a and keeping robot A fixed. Because 
we exclude the diagonal line segment that corresponds to the robots coinciding 
on edge a , there are two components to (F x a) — A: the triangle below the 
diagonal line segment, and the rest of (F x a) — A, as shown in the figure. 


a 



FIGURE 6.32: The subset (Y x a) - A of the safe configuration space (F x Y) — A. 

In Figure 6.33 we show (Y x /J) — A and (F x y) — A. The situation 
there is similar to that shown in Figure 6.32. It is important to realize that 
the edges marked with a single arrow in (F x /5) - A and (F x y) - A 
correspond to the edge marked with a single arrow in (F x a) — A. Each of 
these edges corresponds to the configurations where robot B is fixed at v and 
robot A is somewhere on edge a. Therefore these edges are glued together 
in the construction of (F x F) — A from (F x a) — A, (F x /l) — A, and 
(F x y) — A. In a similar manner, the edges marked with double arrows are 
glued together in the construction of (F x F) — A, as are the edges marked 
with triple arrows. 
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FIGURE 6.33: The spaces (Y x /?) — A and (K x y) — A. 

Taking (Y x a) — A, (Y x fi) — A, and (Y x y) — A and gluing them 
together so that the arrows appropriately match, we obtain the safe configura¬ 
tion space SC 2 (Y) = (F x Y) — A shown in Figure 6.34. The parts labeled 
1, 2, and 3, come from (Y x a) — A, (Y x fi) — A, and (Y x y) — A, respec¬ 
tively. The hole in the middle is the deleted diagonal point corresponding to 
having both robots located at v. This safe configuration space SC 2 (Y) is path 
connected. Therefore a pair of robots is freely transportable on Y. 



FIGURE 6.34: The safe configuration space SC 2 (Y ) = (T x Y) — A. 

In Figure 6.35, looking directly down on SC 2 (Y ) as shown in Figure 6.34, 
we depict a relocation of the robots that switches their positions, starting with 
robot A near the end of a and robot B near the end of p. The path corresponds 
to the following maneuver of the robots: 

(i) Keeping robot B fixed, move robot A to point v and then onto 
edge y. 

(ii) Keeping robot A fixed, move robot B to point v and then out to its 
final position on edge a. 

(iil) Keeping robot B fixed, move robot A back to point v and then out 
to its final position on edge fi. 
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FIGURE 6.35: A relocation that switches the positions of two robots on Y. 

_i 


Exercises for Section 6.5 

6.53. Consider SC 2 (R), the safe configuration space for two robots on a line. For 
each of the following cases, sketch a line segment in SC 2 (R) illustrating the 
described relocation: 

(a) Robot B is to the left of robot A and they move in opposite directions, 
toward each other. 

(b) Robot B is to the left of robot A and they both move to the left, with robot 
B traveling further than robot A. 

(c) Robot A is to the left of robot B and they move in opposite directions, 
away from each other. 

(d) Robot A is to the left of robot B and they both move to the right, with 
robot A traveling further than robot B. 

6.54. Describe a general procedure for performing a relocation from any given con¬ 
figuration of two robots on the circle to any other configuration. 

6.55. Describe the safe configuration space for three robots moving on R, indicating 
how many different path components make up the space and discussing which 
relative positions of the robots correspond to each path component. 

6.56. Suppose that we have two robots on the line, and there is a particular coordinate 
jco on the line that they cannot occupy. Sketch and describe the safe configu¬ 
ration space for this situation. Indicate how many different path components 
make up the space and discuss which relative positions of the robots correspond 
to each path component. 

6.57. Sketch a path on S'C 2 (F) representing a relocation that switches the robots’ po¬ 
sitions, starting with both robots located on edge y. Describe the corresponding 
robot maneuvers. 

6.58. Consider SC 2 (P), the safe configuration space for two robots on the space P 
shown on the left in Figure 6.36. 

(a) The space SC 2 (P) can be obtained as a quotient space of SC 2 (Y) by 
gluing together parts of SC 2 (Y). On an illustration of SC 2 (Y), indicate 
which parts are glued together to obtain SC 2 (P). (Hint: Think about how 
to obtain P from Y by identifying points on Y.) 
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(b) On a representation of SC 2 (P), sketch a path illustrating a relocation that 
switches the robots’ positions, starting with robot A at the top of P and 
robot B at the bottom. Describe'the corresponding robot maneuvers. 

6.59. Consider SC 2 (X), the safe configuration space for two robots on the space X 
shown on the right in Figure 6.36. 

(a) Sketch a representation of SC 2 ( X ), showing how it can be constructed by 
gluing together subsets in a manner similar to our construction of SC 2 (F). 

(b) On your representation of SC 2 (X), sketch a path illustrating a relocation 
that switches the robots’ positions, starting with robot A at the top left of X 
and robot B at the top right. Describe the corresponding robot maneuvers. 




FIGURE 6.36: The spaces P and X. 
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So far we have introduced two of the “Three Cs” of introductory topology: 
continuity and connectedness. In this chapter, we present the third: compact¬ 
ness. This concept is not as intuitive as continuity or connectedness. In R", 
the compact sets are the closed and bounded sets, but in a general topological 
space the compact sets are not as simple to describe. In fact, a formal definition 
for compactness in a general topological space took some time for topologists 
to establish. Several definitions were suggested during the development of 
topology in the early twentieth century. Ultimately, topologists settled on the 
definition of compactness proposed in 1923 by Pavel Sergeevich Alexandroff 
(1896-1982) and Pavel Urysohn; it is presented here in the first section. 

We define compactness and examine some related results and examples 
in Section 7.1. Compactness in metric spaces is addressed in Section 7.2. In 
Section 7.3, we prove the Extreme Value Theorem and then use it to establish 
a number of other useful results. In Section 7.4, we investigate limit point 
compactness, a property that is closely related to compactness. In particular, 
we prove that in a metric space, limit point compactness is equivalent to com¬ 
pactness. Finally, in Section 7.5, we introduce the one-point compactification, 
a construction that enables us to view a noncompact space X as a subspace of 
a compact space that is obtained by adding a single point to X and defining an 
appropriate topology. 

7.1 Open Coverings and Compact Spaces 

Consider the subsets of M 2 in the standard topology depicted in Figure 7.1. Our 
goal in this chapter is to establish the distinction between the subsets in the 
figure that are labeled “compact” and the subsets that are labeled “noncompact.” 



FIGURE 7.1: Compact and noncompact subsets of R 2 . 
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We start with some important definitions associated with compactness. 

DEFINITION 7.1. Let Abe a subset of a topological space X, and let O be 
a collection of subsets of X. 

(i) The collection O is said to cover A or to be a cover of A if A is 
contained in the union of the sets in O. 

(ii) If O covers A, and each set in O is open, then we call O an open 
cover of A. (See Figure 7.2.) 

(iii) If O covers A, and O' is a subcollection of O that also covers A, 
then O' is called a subcover of O. 



FIGURE 7.2: An open cover of A. 

By the definition of a basis, it follows that every basis for a topological 
space X is an open cover of X. 

The two collections of intervals 

Oi ={..., (-1,1), (0, 2), (1, 3),...} and 0 2 = {(-oo, 1), (0, oo)} 

are both open covers of R. As we see from these collections, we can have open 
covers that consist of infinitely many sets, and we can have open covers that 
consist of finitely many sets. We are interested in spaces for which every open 
cover containing infinitely many sets can be reduced to a subcover containing 
finitely many sets. Specifically, we have the following definition: 

DEFINITION 7.2. A topological space X is compact if every open cover of 
X has a finite subcover. 


EXAMPLE 7.1. The real line E in the standard topology is not compact since 
0 = {... ,(-1,1), (0,2), (1,3),...} 

is an open cover, but no finite subcollection of O covers R. (See Figure 7.3.) 
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FIGURE 7.3: An open cover of R with no finite subcover. 
_ 1 


EXAMPLE 7.2. Let X = {* 1 , ... , je„ } be a topological space that contains 
only finitely many points. Then X is compact since there can be only finitely 
many open sets in the topology on X, and therefore every open cover of X is 
already finite. 


We extend the definition of compactness to subsets of a topological space, 
as follows: 

DEFINITION 7.3. Let X be a topological space, and assume A C X. Then A 
is said to be compact in X if A is compact in the subspace topology inherited 
from X. 

The following lemma allows us to check whether or not a subspace A of 
a topological space X is compact by considering covers of A that are made up 
of open sets in X rather than covers of A that are made up of open sets in the 
subspace topology on A: 

LEMMA 7.4. Let X be a topological space, and assume A C X. Then A is 
compact in X if and only if every cover of A by sets that are open in X has a 
finite subcover. 

Proof. Let A be compact in X, and suppose that O is a cover of A by 
open sets in X. Then O' = [U D A \ U e Oj is a cover of A by open sets 
in A. Hence, there exists a finite subcover [U\ fl A, U 2 HA,... ,f/„nA} 
of O'. But then {t/i, U 2 , ■■■ , U n } is a finite subcover of O. Therefore 
every cover of A by open sets in X has a finite subcover. 

Conversely, suppose every cover of A by sets that are open in X 
has a finite subcover. Let O = {Vp}p e B be a cover of A by open sets 
in A. Then, by definition of the subspace topology, for each Vp there 
is an open set Up in X such that Vp = Up fl A. It follows that the 
collection O' = {Up}p e B is a cover of A by open sets in X. Since O' 

has a finite subcover {(/^. Up n ], it follows that {V^ 1 . VpJ is a 

finite subcover of O. Thus every cover of A by open sets in A has a finite 
subcover, and therefore A is compact. ■ 
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EXAMPLE 7.3. The subset A — {0} U {£ | n e Z+} is compact in R. To 
see this, let O be a cover of A by open sets in R. There exists at least one 
open set Uq in O that contains the point 0. Such an open set contains all but 
at most finitely many of the points in A. (See Figure 7.4.) If Uq contains all 
of the points in A, then Uo, by itself, is a finite subcover of O. Otherwise, let 
1/m be the smallest of the points in A that are not in Uq. For each point \/i, 
there is an open set Uj in O that contains it. It follows that the finite collection 
[Uq, U\,U m ) is a subcover of O. Thus, A is compact in R. 


FIGURE 7.4: The set Uq contains all but finitely many points from A. 


EXAMPLE 7.4. Consider (0, 1] as a subspace of R. This space is not com¬ 
pact. The collection O = {(£, 2) | n £ Z+} is a cover of (0, 1] by sets that 
are open in R. There is no finite subcollection of O that covers (0,1], and 
therefore (0, 1] is not compact as a subspace of R. 


Two homeomorphic spaces are either both compact or both noncompact. 
Every open cover O of one is sent to an open cover O' of the other by a home- 
omorphism. Similarly, every finite subcover of O is sent to a finite subcover 
of O' by the homeomorphism. 

Since R and (0, 1] are not compact, it follows that every interval of the form 
(a, b), (—oo, b), ( a , oo), [a, b), [a, oo), (a, b\, or (—oo, b ] is not compact as 
well. What about intervals in the form [a, b]l In the next section, we will 
show that these intervals are compact in the standard topology. 

We actually do not need the full strength of a homeomorphism to ensure 
that a compact space maps to a compact space. The next theorem asserts that, 
as with connectedness and path connectedness, compactness is preserved by 
continuous functions. 

THEOREM 7.5. Let f : X — »■ Y be continuous, and let A be compact in X. 
Then f(A) is compact in Y. 

Proof. Let / : X -> Y be continuous, and assume that A is compact 
in X. To show that f{A) is compact in Y, let O be a cover of / (A) by 
open sets in Y. Then f~ l (U) is open in X for every open set U in O. 
Hence O' — {/ _1 (U) \U e O} is a cover of A by open sets in X. Since 
A is compact, Lemma 7.4 implies that there is a finite subcollection of 
O', say {/”' (Uj ),..., f~ l (U„)), that covers A. Then the collection of 
open sets [U \,..., U n ) in O covers /(A). Thus, O has a finite subcover, 
implying that /(A) is compact in Y. ■ 
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One immediate consequence of Theorem 7.5 is that a quotient space of a 
compact space X is compact since it is the image of X under a quotient map, 
and a quotient map is a continuous function. 

The following theorem provides some results on the compactness of unions 
and intersections of compact sets: 

THEOREM 7.6. Let X be a topological space. 

(i) If C i,... ,C n are each compact in X, then (J" =1 Cj is compact 
in X. 

(ii) If X is Hausdorff, and {C a } a6 A is a collection of sets that are 
compact in X, then HaeA is compact in X. 

Proof. See Exercise 7.2. ■ 


An arbitrary union of compact sets need not be compact. (See Exercise 7.3.) 
Also, if we drop the assumption that X is Hausdorff in the second part of 
Theorem 7.6, then an intersection of compact sets need not be compact. (See 
Exercise 7.18.) 

The next two theorems show that being closed and being compact are 
closely related properties. 

THEOREM 7.7. Let X be a topological space and let D be compact in X. If 
C is closed in X, and C c D, then C is compact in X. 

Proof. Let D be compact in the topological space X. Suppose that C is 
closed in X and C C D. Further, suppose that O is a cover of C by sets 
that are open in X. The set X — C is open. Add X — C to the collection 
O to obtain a new collection O' = O U {X — C}. (See Figure 7.5.) The 
collection O' is an open cover of X and therefore is an open cover of D. 
Since D is compact in X, there is a finite subcollection of O' that covers 
D. The set C is covered by those sets in the finite subcover of O' that 
were originally in O. Therefore there is a finite subcollection of O that 
covers C, implying that C is compact in X. ■ 



FIGURE 7.5: Adding X - C to an open cover of C results in an open cover of the 
whole space X. 
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Theorem 7.7 indicates that a closed subset of a compact space is compact. 
The converse relationship, however, does not generally hold. A compact set in 
a topological space is not necessarily a closed set, as demonstrated in the next 
example. 


EXAMPLE 7.5. ConsiderR /c , the real line in the finite complement topology. 
Every subset of R/ c is compact. (See Exercise 7.1.) Aside from the whole set 
itself, every other infinite set A c R/ c is not closed, since the complement of 
such a set A is not open. Therefore there are subsets of R/ c that are compact 
but not closed. 


Although compact sets in a topological space are not necessarily closed, the 
following theorem indicates that there are general circumstances under which 
compact sets are automatically closed. 

THEOREM 7.8. Let X be a Hausdorff topological space and A be compact 
in X. Then A is closed in X. 

Proof. Let A be compact in the Hausdorff space X. To show that A is 
closed, we prove that X — A is open. Thus, let x € X — A be arbitrary. 
We show that there is an open set U such that x e U C X — A. 

Since X is Hausdorff, we know that for each a e A, there exist 
disjoint open sets U a and V a such that x e U a and a e V a . (See 
Figure 7.6.) Then O = {V a } a eA is an open cover of A. Because A is 
compact, there is a finite subcover [V ai ,..., V Un } of O. Let V = V a , 
and U = fj/Li Ua, ■ Then U and V are open sets such that A C V and 
x € U. Furthermore, since U aj and V aj are disjoint for each i, it follows 
that U and V are disjoint as well. Thus U and A are disjoint, and therefore 
there exists an open set U such that x e U c X — A, as we wished to 
show. Hence X — A is open, implying that A is closed. ■ 



FIGURE 7.6: Using the Hausdorff property to separate x from points in A. 
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The next lemma will help us subsequently prove that a finite product of 
compact spaces is compact. 

LEMMA 7.9. The Tube Lemma. Let X and Y be topological spaces, and 
assume that Y is compact. Ifx € X, and U is an open set in X x Y containing 
{jc} x Y, then there exists a neighborhood W ofx in X such that W x Y C U. 

The lemma asserts that if an open set in X x Y contains a slice {x} x Y of 
X x Y, then there is an open tube W x Y containing the slice and contained 
in the open set. (See Figure 7.7.) Notice that, without the assumption that Y 
is compact, the lemma does not necessarily hold. (See Exercise 7.12.) 



FIGURE 7.7: The tube WxKis contained in the open set U. 


Proof. For each y e Y pick open sets W y in X and V y in Y such that 
(x, y) € W y x V y c U. The collection of sets {V y } y€ y is an open 
cover of Y. Since Y is compact, finitely many of these sets cover Y, 
say V n , ... , V yn . Let W = f|"=i Wy,. Then W is open in X, and W 
contains x because each W y contains x. Note that 

n 

W xY c 1J( W * x v yi) C U. 

i=i 

Therefore W x Y contains {x} x Y and is contained in U, as desired. 


THEOREM 7.10. IfX and Y are compact topological spaces, then the prod¬ 
uct X x Y is compact. 

Proof. Let O be an open cover of X x Y. For each x € X, the set (x) x Y 
is compact in X x Y. Therefore a finite subcollection O x of O covers 
{x} x Y. Let U x be the union of the sets in O x . The set U x is open in 
X x Y and contains {x} x Y. By Lemma 7.9, for each x e X there exists 
an open set W x C X such that x e W x and W x x Y C U x . Note that O x 
covers W x x Y. 
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The collection W = [W x \ x e X) is an open cover of X. Since 
X is compact, it is covered by finitely many of the sets in W, say 
W Xl ,... , W Xm . It follows that C = 0 X] U ... U 0 Xm covers X x Y. 
The collection C is a subcollection of O and is finite, being a finite union 
of finite sets. Therefore O has a finite subcollection that covers XxF, 
implying that X x Y is compact. B 

Recall from Theorem 3.9 that if A and B are subsets of topological spaces 
X and Y, respectively, then the product of the subspace topologies that they 
inherit from X and Y is the same as the subspace topology that A x B inherits 
from X x Y. In particular. Theorems 3.9 and 7.10 together imply that if A is a 
compact subset of X, and B is a compact subset of Y, then Ax B is a compact 
subset of X x Y. Using this result and induction, we then obtain the following 
corollary: 

COROLLARY 7.11. Let X\,... ,X n be topological spaces, and let A ,• be a 
compact subset of Xi for each i = 1,... , n. Then A\ x ... x A n is a compact 
subset of the product space X\ x ... x X n . 

Exercises for Section 7.1 

7.1. Show that every set A c R is a compact subset of R in the finite complement 
topology on R. 

7.2. Prove Theorem 7.6: Let X be a topological space. 

(a) If Ci,... , C„ are each compact in X, then (J”=i Cj is compact in X. 

(b) If X is Hausdorff, and {C a }aeA is a collection of sets that are compact in 
X, then riaeA C<* is compact in X. 

7.3. Provide an example demonstrating that an arbitrary union of compact sets in a 
topological space X is not necessarily compact. 

7.4. Prove that Z with the digital line topology is not compact. 

7.5. Let A = D — { O), the disk in R 2 with the origin removed. Show that A is not 
compact. 

7.6. Consider the topology on Z generated by the basis B = {(—n, n) \ n e Z+). 

(a) Determine whether or not (—5, 5) is a compact subset of Z in this topology. 

(b) Determine whether or not Z is compact in this topology. 

7.7. Recall that the arithmetic progression topology on Z is generated by the basis 
B = {A a ,6 | a, b € Z, b fk 0}, where each 

A a ,b = {..., a — 2b, a — b, a, a + b, a + 2b ,...} 

is an arithmetic progression. Determine whether or not Z is compact in this 
topology. 

7.8. Let X be a compact topological space, and let {C, }; € z + be a collection of 
nonempty closed sets in X satisfying C,+i C C,- for each i 6 Z+. Prove that 

n~i c,*0. 
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7.9. Prove that if X is compact and Hausdorff, then X is normal. (Hint: First prove 
that X is regular.) 

7.10. Prove the converse of Theorem 7.10: If X x Y is compact, then so are X and Y. 

7.11. (a) Let / : X -» Y be a continuous bijective function. Prove that if X is 

compact and Y is Hausdorff, then / is a homeomorphism. 

(b) Provide an example where / : X -> Y is a continuous bijective function 
and X is compact, but / is not a homeomorphism. 

(c) Provide an example where / : X —> Y is a continuous bijective function 
and Y is Hausdorff, but / is not a homeomorphism. 

7.12. Show that the Tube Lemma does not necessarily hold if we drop the assumption 
that Y is compact. That is, provide an example of a noncompact space Y and 
an open set U in X x Y such that U contains a slice {x} x Y c X x Y but does 
not contain an open tube IfxF containing the slice. 

7.13. Let f : X —> Y be a function. The graph of / is the subset ofX x f given by 
G = {(x, /(*)) | x e X). 

(a) In Exercise 4.10 we established that if / : X -*■ Y is continuous and Y is 
Hausdorff, then G is a closed subset of X x Y. Here we prove a converse. 
Assume that X and Y are topological spaces and Y is compact. Show that 
if G is a closed subset of X x Y, then / is continuous. (Hint: Given U 
open in Y, and x e f~ l (U), show that the slice {x} x Y is contained in 
the open set (X - G) U (X x U), and then apply the Tube Lemma.) 

(b) Show that the result from part (a) does not hold if we drop the assumption 
that Y is compact. That is, find an example of a noncompact space Y and a 
function / : X -+ Y such that the graph of / is a closed subset of X x Y, 
but / is not continuous. 


7.2 Compactness in Metric Spaces 

In real analysis, where the focus is on R” with the standard metric and topol¬ 
ogy, sometimes a set is defined to be compact if it is closed and bounded. In 
this section, we show that in R" such a definition is consistent with the topo¬ 
logical definition already presented. We then address the extent to which this 
equivalence carries over to general metric spaces. Following that, we present 
some important convergence properties of compact sets in a metric space. 

We begin with the following lemma, which we subsequently use to show 
that closed and bounded intervals in R are compact: 

LEMMA 7.12. The Nested Intervals Lemma. Let {[a„, b„]} n€ % + be a 
collection of nonempty closed bounded intervals in R such that [a n+ \, b n+ \] C 
[a „, b n ]for each n € Z+. Then p)^=i f a n , b n ] is nonempty. 

Proof. Assume that [a n+ \, b n+ \] C [a n , b n ] for each n e Z+, as illus¬ 
trated in Figure 7.8. 

It follows that the interval endpoints satisfy the inequalities, 

a\ < a 2 < ... < a n < ... < b„ < ... < b 2 < bu 

The set {a n ) ne z + is bounded from above (for example, by each b n ) and 
therefore has a least upper bound A. Similarly, the set {b n }neZ+ has a 
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FIGURE 7.8: The interval [a n+ \,b n+ \] is a subset of [a n , b n ] for each n. 

greatest lower bound B. Note that A < B and therefore the interval 
[A, Z?] is nonempty. 

We claim that f)%L\[a n , b n ] = [A, 5], to complete the proof of 
the lemma. First, we show that {X? = \[a n ,b„] C [A, B], Thus, let 
x € b n ] be arbitrary. Then x e [a n , b n ] for all n, implying 

that x > a n and x < b n for all n. Therefore x > A and x < B; that is, 
X e [A, B]. Hence (XLd<*n, b H ] C [A, B ]. 

To prove that [A, B] c Hn^=i [ Q n,b n ], let x € [A, B] be arbitrary. 
Then x > a n and x < b n for all n. Therefore x e i a n, b„], Thus 

[A, B ] c Dnli[ a n> Land it follows that n^i fan. b n ] = [A, B]. ■ 

Lemma 7.12 does not hold if we replace the closed intervals with open 
intervals. For instance, the collection of nonempty bounded open intervals 
{(0, £) | n € Z+} satisfies the condition that (0, ^y) C (0, £) for each 

n € Z+, but rr=l (°> = 0 - 

THEOREM 7.13. Every closed and bounded interval \a, b] is a compact 
subset ofR with the standard topology. 

Proof. Let O be a cover of [a, b] by open sets in R. To derive a contra¬ 
diction, assume that there is no finite subcollection of O covering [a, b]. 

Consider the intervals \a, and [2^2, b] obtained by dividing 
[a, b] in half. The collection O covers both of these intervals. For at 
least one of the two, there is no finite subcollection of O that covers it 
(otherwise there would be a finite subcollection of O covering [a, b], 
contrary to our assumption). Choose such a half of [a, b ], and denote it 
by [a\,b\\. 

In a similar manner, we can choose a half of [a\, b\] that is not 
covered by a finite subcollection of O and denote it by [a %, bf]. 

We repeat this process. In other words, given [ a n , b n ], a subset of 
[a, b] that is not covered by a finite subcollection of O, choose a half of 
[an, b n \ that is not covered by a finite subcollection of O, and denote it 
by [a„ +l ,b„+i]. (See Figure 7.9.) 

a n +1 b n+ 1 

[ -E-J-]-3 

a a n b n b 

FIGURE 7.9: Each interval [a n , b n ] is not covered by a finite subcollection of O. 
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Consider the collection of intervals [[a n , b„]} ne z+- Based on the 
construction of these intervals, the following statements hold for each 
n € Z+: 

(1) [< 2 / 1 + 1 * b n +\] C [a n ,b, j], 

(ii) b n - a n = ^r, 

(iii) [a„, b n ] is not covered by a finite subcollection of O. 

By Lemma 7.12, it follows that [<?«> b„] is nonempty. Let x be 
in this intersection. Then x e [a. b], and therefore there exists U e O 
such that x € U. Since U is open in R, there exists s > 0 such that 
(x — e, x + e) C U. Let N be a positive integer large enough so that 
< £■ Since x e b„], it follows that x e [aN,b}j]. 

Furthermore, since b^ — on = < £, it follows that [«w, bw] c 

(x — s, x + e) c U. But then [ a n, b n] is covered by a single set in O, 
contradicting the fact that [a^ , b n 3 is not covered by a finite subcollection 
of O. 

Thus there must be a finite subcollection of O that covers [a, b\, and 
it follows that [a, b] is compact. ■ 


EXAMPLE 7.6. Recall from Example 3.17 that a topological graph is a space 
obtained by taking finitely many points (vertices) and finitely many closed 
bounded intervals and gluing the endpoints of the intervals to the vertices. A 
collection of finitely many points and finitely many closed bounded intervals 
is a compact space since it is a finite union of compact spaces. Thus every 
topological graph is compact since it is the image of a compact space under a 
quotient mapping. 


In the next theorem, we extend Theorem 7.13 to a corresponding result for 
products of closed bounded intervals in R". The proof follows directly from 
Theorem 7.13 and Corollary 7.11. 


THEOREM 7.14. Let [a\,b\\,... , [a n ,b n ] be closed bounded intervals in 
R. Then [ai, b\\ x ... x [a n , b n ] is a compact subset of R". (See Figure 7.10.) 


A 




FIGURE 7.10: A product of closed bounded intervals is a compact subset of R' ! . 
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Now, using Theorem 7.14 we can completely determine the compact sub¬ 
sets of R n in the standard topology. Recall that in a metric space ( X , d) a set 
A is said to be bounded if there exists m > 0 such that d(x,y) < m for all 
x, y € A. 


THEOREM 7.15. Let R' ! have the standard topology and the standard metric 
d. Aset A C R" is compact in R" if and only if it is closed and bounded. 


Proof. Let A be compact in R" . Then since R" is Hausdorff, it follows 
by Theorem 7.8 that A is closed. To see that A is bounded, consider the 
collection O = {B(0, n) \n e Z+}, made up of open balls centered at 
the origin in R". The collection O is an open cover of A, and since A 
is compact, it follows that finitely many of the sets in O cover A. Thus, 
there exists N e Z + such that A c B(0, N). Therefore, for x, y e A, 
we have d(x. y) < 2 N, implying that A is bounded. 

Now assume that A is closed and bounded. Let a = (a\ . a n ) be 

a point in A, and assume that d(x, y) < M for all x and y in A. Then A 
is contained in the product of intervals 


P = [a\ — M,a\ + M]x ... x[a n — M, a n -f M], 


as illustrated for the two-dimensional case in Figure 7.11. The set Pisa 
compact subset of R" by Theorem 7.14. Since A is closed and a subset 
of P, it follows by Theorem 7.7 that A is compact in R”. ■ 


a2 + M 


02 ~ M 
a l 

FIGURE 7.11: The set A is a subset of the compact set [a\ - M,a\ + M] x [«2 - 
M , 02 + M], 



For subsets of R" in the standard metric and topology. Theorem 7.15 im¬ 
plies that defining compact as closed and bounded is equivalent to defining it 
in terms of open coverings. 

Now, again considering Figure 7.1, we see that the sets in the plane that 
are labeled “compact” are exactly those sets that are closed and bounded, and 
the sets that are labeled “noncompact” are the sets that are either not closed or 
not bounded. 
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EXAMPLE 7.7. The circle S 1 is compact since it is a closed and bounded 
subset of R 2 . Recall that a knot is an embedding of S 1 in R 3 . Since S 1 is 
compact, the image of such an embedding is compact in M 3 . We usually think 
of a knot as the image of an embedding in R 3 , rather than the embedding itself. 
Thus, we say that a knot is a compact subset of R 3 . 


EXAMPLE 7.8. The torus T is compact. We prove this in three ways. 

Method 1: The torus T is the subspace of R 3 obtained by rotating a circle 
about the z-axis (as described in Example 3.5). The torus is closed since given 
any point in its complement, there is an open ball of sufficiently small radius 
centered at that point and disjoint from the torus. Also, the torus is bounded 
since it is contained in the ball of radius 4 centered at the origin. Since the 
torus is closed and bounded in R 3 , it is compact. 

Method 2: The torus is homeomorphic to the product space S 1 x S 1 . The 
circle S 1 is compact. Therefore the torus is the product of two compact spaces, 
and thus it is compact. 

Method 3: The torus is homeomorphic to a space obtained as a quotient 
space of the square by identifying opposite edges. The square is a closed and 
bounded subset of R 2 and therefore is compact. Furthermore, a quotient map 
on the square is a continuous function, and the image of a compact space under 
a continuous function is compact. Therefore the torus is compact. 


The second method in Example 7.8 also allows us to conclude that for 
each n > 2, the n-torus is a compact topological space. Furthermore, the third 
method allows us to conclude that the annulus, Mobius band, Klein bottle, 
sphere, and projective plane are all compact spaces as well. 

Now, if in Theorem 7.15 we replace R" with an arbitrary metric space, is 
being compact still equivalent to being closed and bounded in this, more gen¬ 
eral, setting? The answer is no. One implication in the equivalence continues 
to work; that is, in a metric space, a compact subset is closed and bounded. 
However, a closed and bounded subset of a metric space need not be compact. 
We ask you to prove these results in Exercise 7.19. 

One of the properties that mathematicians were trying to capture in the 
concept of compactness is that sequences have a convergent subsequence. In 
the next theorem we show that this holds for compact sets in a metric space. 


THEOREM 7.16. Let(X, d) be a metric space, and assume that A is compact 
in X. If (x n ) is a sequence in A, then there exists a subsequence (x nm ) of (x n ) 
that converges to a limit in A. 
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Proof. We claim that there exists a e A such that every nonempty open 
ball centered at a contains x n for infinitely many n. Suppose this does not 
hold. Then for every a e A there exists s a such that s a ) contains 

members of the sequence (x n ) for at most finitely many n. Consider 
the collection O = {Bd(a, e a )} a eA- No finite subcollection of O can 
cover the sequence ( x n ) since each set Bd(a, e a ) contains members of 
the sequence (x n ) for only at most finitely many n. On the other hand, 
O is an open cover of the compact set A and therefore there is a finite 
subcollection of O that covers A. Since the sequence (x„) is in A, this is 
a contradiction. Thus, there exists a e A such that every nonempty open 
ball centered at a contains x n for infinitely many n. 

Now, for each me Z + let x„ m be a point in the sequence (x n ) such 
that n m > n m -\ and x„ m e Bd(a, ^). Then (x „ m ) is a subsequence of 
(x„) converging to a e A. ■ 


EXAMPLE 7.9. Consider the sequence in [0,1] given by 
fil 1 i 1 5 7 i 1 ) 

The sequence bounces around in [0,1], and no term in the sequence repeats. 
However, since [0,1] is compact. Theorem 7.16 guarantees that there is a 
subsequence that converges to some point a e [0,1], Interestingly, in this case 
we can do better. In fact, for every a € [0,1] there is a subsequence converging 
to a. (See Exercise 7.20.) 


An important convergence question for metric spaces is the following: If 
the points in a sequence ( x n ) get closer and closer to each other, then does 
the sequence converge? We investigate this question in the remainder of the 
section. 

DEFINITION 7.17. Let ( X , d) be a metric space. A sequence (x n ) in X is 
called a Cauchy sequence if for every e > 0 there exists N e Z+ such that 
d(x n , x m ) < sfor every n,m > N. 

The defining condition for a Cauchy sequence indicates that the points 
in the sequence (x n ) are getting close to each other as n gets large, but it 
does not say anything about whether or not the sequence converges. For R", 
the following theorem settles the question about the convergence of Cauchy 
sequences: 

THEOREM 7.18. Let (x n ) be a Cauchy sequence in K” with the standard 
metric d. Then (x n ) converges to a limit in R”. 
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Proof. Suppose that (x„) is a Cauchy sequence in R”. Pick a fi > 
0. Since ( x „) is a Cauchy sequence, there exists N e Z+ such that 
d(x n ,x m ) < p, for all n, m > N. Let C be the closed ball of radius /x 
centered at xn- Then C is a compact set since it is closed and bounded. 
Also, the sequence (xn, xn+ i, ...) is in C. By Theorem 7.16, the se¬ 
quence (xn, xn+\, ...) has a subsequence that converges to a limit jc* 
in C. We claim that the original sequence ( x „) converges to jc*. 

Thus, suppose that e > 0 is arbitrary. Because ( x n ) is a Cauchy 
sequence, we can choose M e Z+ such that d(x n , x m ) < e/2 for every 
n,m > M. Now, let k > M be arbitrary. We show that d(xk, x*) < e, 
proving that (jc„) converges to x*. Since (xn, jcat+i ,.. •) has a subse¬ 
quence converging to jc*, there exists j > M such that d(xj, jc*) < e/2. 
Also, having j,k > M implies that d( jc*, jc j) < e/2. Therefore, by the 
triangle inequality, it follows that d(xk, jc*) < e, as desired. ■ 


DEFINITION 7.19. A metric space X is called complete if every Cauchy 
sequence in X converges to a limit in X. 


By Theorem 7.18, it follows that R" with the standard metric is a complete 
metric space. However, if we let X = R — {0} with the metric d(x,y) = 
|jc — y|, then X is not a complete metric space. The sequence (1, 5, 5, 5 , ■ ■ •) 
is a Cauchy sequence in X that does not converge in X. 

The following theorem is another useful convergence result for compact 
metric spaces. 


THEOREM 7.20. IfX is a compact metric space, then X is complete. 

Proof. See Exercise 7.21. ■ 

Exercises for Section 7.2 

7.14. Does Lemma 7.12 hold if we replace nonempty closed bounded intervals in R 
with nonempty closed sets in R? Specifically, prove the following implication 
or find a counterexample: If {A„}„ e z + is a collection of nonempty closed sets 
in R such that A „+1 c A n for each n e Z + , then HneZ + ' s nonempty. 

7.15. Show that [0,1] is not a compact subset of R with the lower limit topology. 

7.16. If Rrf denotes R with the discrete topology, determine which of the subsets, 
A = {(jc, 2) | 1 < x < 2} and B = {(3, y) | 1 < y < 2}, as shown in 
Figure 7.12, is a compact subset of R^ x R. 

7.17. Use compactness to prove that the plane is not homeomorphic to the sphere. 
(Recall, in Section 6.2 we distinguished between a number of pairs of spaces, 
including the line and the plane and the line and the sphere, but we indicated that 
we were not yet in a position to distinguish between the plane and the sphere. 
With compactness, we can now make that distinction.) 
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FIGURE 7.12: Is either A or B a compact subset of x R? 


7.18. In this exercise we demonstrate that if we drop the condition that X is Hausdorff 
in Theorem 7.6, then the intersection of compact sets in X is not necessarily 
a compact set. Define the extra-point line as follows. Let X = R U {p e }, 
where p e is an extra point, not contained in R. Let B be the collection of 
subsets of X consisting of all intervals ( a, b ) c R and all sets of the form 
(c, 0) U [p e ] U (0. d) for c < 0 and d > 0. 

(a) Prove that B is a basis for a topology on X. 

(b) Show that the resulting topology on X is not Hausdorff. 

(c) Find two compact subsets of X whose intersection is not compact. Prove 
that the sets are compact and that the intersection is not. 

7.19. (a) Let (X, d) be a metric space. Prove that if A is compact in X, then A is 

closed in X and bounded under the metric d. 

(b) Provide an example demonstrating that a subset of a metric space can be 
closed and bounded but not compact. 

7.20. Consider the sequence defined in Example 7.9. Prove that for every a e [0,1 ] 
there is a subsequence converging to a. 

7.21. Prove Theorem 7.20: If X is a compact metric space, then X is complete. 


7.3 The Extreme Value Theorem 

The Extreme Value Theorem, like the Intermediate Value Theorem, is a topo¬ 
logically based theorem that is often introduced in a calculus course. As we saw 
in Chapter 6, the Intermediate Value Theorem concerns continuous real-valued 
functions on a connected domain. The Extreme Value Theorem concerns con¬ 
tinuous real-valued functions on a compact domain. In this section we prove 
a general version of the Extreme Value Theorem and consider a number of its 
useful consequences. 

To begin, we need the following lemma, which indicate that every compact 
subset of the real line contains a maximum value and a minimum value. (See 
Figure 7.13.) 
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FIGURE 7.13: Every compact subset of E has maximum and minimum values. 

LEMMA 7,21. Let Abe a compact subset ofR. Then there exist m, M e A 
such that m < a < M for all a e A. 

Proof. Here we prove the existence of the maximum value M. The proof 
of the existence of the minimum value m is similar. Since A is compact, 
it is closed and bounded in R. Therefore A is bounded from above. It 
follows that the set A has a least upper bound; denote it by M. Of course, 
a < M for all a € A. We claim that M € A. We prove the claim by 
contradiction; thus suppose that M g A. Since A is closed, it follows 
that there exists an e > 0 such that (M — e, M + e) (T A = 0. Then 
M — | is an upper bound for A that is smaller than M, a contradiction. 
Therefore M e A, and A has a maximum value. ■ 


Now, using Lemma 7.21, we establish the Extreme Value Theorem. 

THEOREM 7.22. The Extreme Value Theorem (General Version). 

Let X be compact and f : X —► R be continuous. Then f takes on a maximum 
value and a minimum value on X; that is, there exist a, b e X such that 
f{a) < fix) < f{b) for all x € X. 

Proof. Since X is compact and / is continuous, fiX) is a compact 
subset of R by Theorem 7.5. Therefore fiX) contains a maximum 
value M and a minimum value m by Lemma 7.21. The points m and 
M are in /(X); therefore there exist a,b e X such that /(a) = m and 
fib) — M. Now, for all x e X we have that fix) e fiX), and thus 
f(a) — m < fix) < M — fib), as we wished to show. ■ 


EXAMPLE 7.10. In this simple application of the Extreme Value Theorem, 
we view the surface of the Earth as a sphere and the surface temperature as a 
continuous function on the sphere. Since the sphere is compact, the Extreme 
Value Theorem ensures that somewhere on the Earth there is a point or there 
are points where the temperature is hotter than those at every other point on 
Earth, and there are also points where the temperature is colder than those at 
every other point on Earth. 


The following corollary of Theorem 7.22 is the version of the Extreme 
Value Theorem that is usually encountered in a calculus course. It follows 
from Theorem 7.22 if we let X = [ a, b ]. 

COROLLARY 7.23. The Extreme Value Theorem on [a, b]. Assume 
that f : [a, b] —» R is continuous. Then f takes on a maximum value and a 
minimum value on [a, b]. 
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Combining the Extreme Value Theorem and the Intermediate Value The¬ 
orem, we obtain the following corollary: 

COROLLARY 7.24. Let [a, b] be a closed and bounded interval in M, and 
assume that f : [a, b] —>■ R is continuous. Then the image of f is a closed 
and bounded interval in M. (See Figure 7.14.) 

Proof. See Exercise 7.22. ■ 



FIGURE 7.14: For a continuous / : [a, b ] -» R, the image /([a, fc]) is a closed and 
bounded interval in R. 


The Extreme Value Theorem is the basis for a variety of optimization 
theorems and applications. Consider the following example: 


EXAMPLE 7.11. The publishers at Klein’s Publishing House would like to 
catch the new wave of interest in applied topology. They are planning to pub¬ 
lish an applied topology text to compete with those currently on the market. 
The business department has committed a $500,000 budget to the first run of 
the book. The budget is to be allocated among several aspects of the book’s 
production, including an authorship contract, editorial costs, printing, adver¬ 
tising, and distribution. Suppose that there are n such variables, vi,... , v„. 
The profit that the publishers can expect to make on this venture depends on 
how the resources are allocated. For example, if they opt for printing the book 
in a high-cost Mobius-band format and a minimal advertising budget, they will 
realize a smaller profit than if they had used a standard book format and a larger 
advertising budget. Thus, we regard the profit, P, as a function of the variables 
vi, ... ,v n . It is natural to assume that P is continuous. The domain of P is 
the subset of K” given by 

D = {(vi,... , v„) | vi > 0,... , v n > 0, ui + ... + v„ < 500,000}. 

Since D is closed and bounded, it is compact. Therefore the Extreme Valde 
Theorem indicates that there is a choice for the allocation of resources resulting 
in a maximum profit for the planned textbook project. 


j 
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While the Extreme Value Theorem does not tell us how to find the maxi¬ 
mum and minimum values of a function, it guarantees that they exist. This is 
typical of this kind of topology theorem. Like the Intermediate Value Theo¬ 
rem, the Extreme Value Theorem asserts the existence of a point with specified 
properties, but does not provide an exact location for it. 

The next two theorems will be helpful to us in establishing subsequent 
results in the text. The Extreme Value Theorem is used in the proof of each. 

Recall that if A and B are subsets of a metric space (X, d), then we define 
the distance between A and B by 

d(A, B ) = glb{d(a, b) \ a € A, b e B). 

The following theorem indicates that there is a positive distance between two 
disjoint compact sets in a metric space. 

THEOREM 7.25. Let (X, d) be a metric space. If A and B are disjoint 
compact subsets of X, then d(A , B) > 0. 

Proof. The distance function d : X x X -*■ R is continuous (see Exer¬ 
cise 5.13), and the set A x B is compact in X x X by Corollary 7.11. 
Therefore the Extreme Value Theorem applies and indicates that d takes 
on a minimum value on A x B. That is, there exist a* € A and b* e B 
such that d(a*, b*) < d(a, b) for every a € A and b e B. It follows 
that d(A, B) = d(a*, b*). Since A and B are disjoint, a* ^ b*, and 
therefore d(a*, b*) > 0. Hence, d(A, B) > 0, as desired. ■ 


IMPORTANT NOTE: The result of Theorem 7.25 does not hold if we replace 
“compact” with “closed.” That is, in a metric space it is possible to have 
disjoint closed sets A and B with d(A, B) =0. (See Exercise 7.23.) 

In the next results, we work with functions defined by taking the distance 
between a point and a set in a metric space. Specifically, let X be a metric 
space and let A be a subset of X. Define /a : X R by /a(x) = d({x]. A). 
Thus, /a(x) is the distance from the single-point set {x} to the set A. We can 
regard it as the distance from the point x to the set A. 

LEMMA 7.26. Let (X, d) be a metric space, and let Abe a subset of X. The 
function f& : X -* R, defined by /a(x) = d((x}, A), is continuous. 

Proof. See Exercise 7.24. ■ 

The following lemma tells us that, given an open cover of a compact metric 
space, there is a threshold value such that every open ball with a radius below 
the threshold is guaranteed to lie in some set in the open cover. 

LEMMA 7.27. The Lebesgue Number Lemma. Let (X, d) be a com¬ 
pact metric space, and let O be an open cover of X. Then there is a X > 0 
such that for every x e X there exists aU € O satisfying Bj (x, A) C U. 

The number X is called a Lebesgue number for the cover O. 
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Proof. Let O be an open cover of X. If the whole space X is in the cover, 
then any real number X serves as a Lebesgue number for the cover. Thus 
assume X is not one of the sets in the cover. 

Since X is compact, finitely many of the sets from O cover X, say 
U\,... , U„. For each i = 1,... , n, set C,- = X — Each set C, is 
nonempty since we are assuming that none of the sets in O is equal to X. 
Define f : X R by 


/(•*) = ~ i^/c, 0)- 

n t— 1 

i=i 

Thus, /( x) is the average of the distances from x to each set C,-. 

We claim that f(x) > 0 for all x e X. To prove the claim, let 
x € X be arbitrary. Since the sets t/, cover X, there exists k such that 
x e Uk- The set 14 is open; consequently there exists e > 0 such that 
B c i(x , e) € Uk- It follows that fc k (x) > e, and therefore f(x) > e/n. 

The function / is continuous since each /c, is continuous by Lemma 
7.26 and since addition and multiplication of continuous functions is con¬ 
tinuous. (See Exercise 4.16.) Furthermore, the domain of / is compact. 
Therefore the Extreme Value Theorem implies that / takes on a minimum 
value X. Since f(x) > 0 for all x, it must be that X > 0. 

We show that X is the desired Lebesgue number. Thus, let x e X 
be arbitrary, and consider Bd(x, X). We claim that this open ball is a 
subset of at least one of the sets Uj and therefore is a subset of some set 
from the cover O. Suppose none of the sets U-, contains Bd(x, X). Then 
fcfx ) < X for all i, implying that f(x) < X. But this contradicts the 
fact that X is the absolute minimum of / over X. It follows that X satisfies 
the requirements of the lemma. ■ 

We will find the following corollary of Lemma 7.27 helpful in some of the 
work we do in the remainder of the text. 

COROLLARY 7.28. Let O be a cover of the closed bounded interval [a, b ] 
by sets that are open in R. Then there is a subdivision 
a = ao < a\ < ... < a n =b 

of [a. b] such that for all j = 1 there exists a Uj € O containing 
l a j -11 a j]. 

Proof. See Exercise 7.26. B 

The Fundamental Theorem of Calculus is another theorem whose proof 
uses the Extreme Value Theorem. Specifically, the Extreme Value Theorem is 
used to prove Rolle’s Theorem, which is equivalent to the Mean Value Theorem, 
and the Mean Value Theorem is an important tool employed in proving the 
Fundamental Theorem of Calculus. 

In Chapter 9 we use topological tools to prove the Fundamental Theorem 
of Algebra. We do not go so far as to say that the Fundamental Theorem of 
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Calculus and the Fundamental Theorem of Algebra are theorems of topology. 
What is important to realize, however, is how intertwined the disciplines of 
algebra, analysis (of which calculus is a part), and topology are. Although 
typically they are distinct courses at the introductory level in an abstract math¬ 
ematics program, they are far from distinct in the topics, tools, and properties 
they cover. This should not be a surprise, because these disciplines have all 
arisen from a desire to understand the structure of the real-number system, 
Euclidean space, and functions between them. 

Exercises for Section 7.3 

7.22. Prove Corollary 7.24: Let [a, b ] be a closed and bounded interval in R, and 
assume that / : [a, b] —*■ R is continuous. Then the image of / is a closed and 
bounded interval in R. 

7.23. Provide an example of closed sets, A and B, in a metric space (A, d) such that 
A and B are disjoint and d(A, B) = 0. 

7.24. Prove Lemma 7.26: Let ( X , d) be a metric space, and let A be a subset of X. 
The function /a : X -» R, defined by /a(x) = <7({x), A), is continuous. 

7.25. Provide an example showing that the Lebesgue Number Lemma does not hold 
if we drop the assumption that the space is compact. 

7.26. Prove Corollary 7.28: Let O be a cover of the closed bounded interval [a, b] by 
sets that are open in R. Then there is a subdivision a = a 0 < a\ < ... < a n — b 
of [a, b ] such that for all j = 1 ,... ,n, there exists a UjeO containing 
[aj-i.cij]. 

Supplementary Exercises: The Tietze Extension Theorem 

Let A be a subset of a topological space X, and let / : A -*■ Y be continuous. It is 
often of interest to know if / extends to a continuous function F : X -» T; that is, if 
there exists a continuous F : X —> Y such that F(a) — f{a) for all a € A. 

The Tietze Extension Theorem provides general conditions under which it can be 
concluded that extensions exist. Specifically, if A is a closed subset of a normal space 
X , and J c R is either a closed bounded interval, an open interval, or all of R, then 
the Tietze Extension Theorem asserts that every continuous / : A -*■ J extends to a 
continuous function F : X —> J. 

In these exercises, we work through a proof of the following special case of the Tietze 
Extension Theorem. A proof of the general Tietze Extension Theorem can be found in 
[Mun]. 

THEOREM 7.29. The Tietze Extension Theorem for Metric Spaces. Let 

A be a closed subset of a metric space X; then every continuous f : A —► [—1, 1] 
extends to a continuous function F : X —> [— 1,1], 

Our proof of the Tietze Extension Theorem uses the Uniform Convergence Theorem 
(Theorem 4.13), which was established in a set of supplementary exercises in Sec¬ 
tion 4.1. 

We use the Tietze Extension Theorem to prove a retraction existence theorem (The¬ 
orem 9.14) in Chapter 9. That result is then used in the proof of the Jordan Curve 
Theorem in Chapter 11. 
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Before proving Theorem 7.29, we need a few supporting results. The following lemma 
indicates that if B and C are disjoint nonempty closed subsets of a metric space X, 
then there exists a continuous real-valued function that takes on constant values b and 
c on B and C, respectively, and takes on values between b and c on the complement of 
BUC: 

LEMMA 7.30. Let B and C be disjoint nonempty closed subsets of a metric space X, 
and assume b, c 6 R withb < c. Then there exists a continuous function g : X —*■ [b, c] 
such that B = g~' (b) and C = g~ ] (c). 

SE7.27. As in Lemma 7.26, let fg(x) and fc(x) be the distances from x e X to 
the sets B and C, respectively. Show that the requirements of Lemma 7.30 are 
satisfied by the function g : X —> [b, c], defined by 

, _ cfB(x) + bfc(x) 

8 f B {x) + fc(x)' 


With Lemma 7.30 we can prove the following lemma, which we then use in the proof 
of the Tietze Extension Theorem: 

LEMMA 7.31. Let X be a metric space, and let A C X be closed. Assume that 
f : A —»■ [—k, k\ is continuous. Then there exists a continuous g : X —► [—|, |] such 
that \f(a) — g(a)| < y for alia 6 A. 


The idea in proving Lemma 7.31 is to have g(x) equal — | for those x for which 
fix) < — to have g(x) equal | for those x for which f(x') > |, and to otherwise 
have g continuously transition between —| and |. (See Figure 7.15.) Specifically, 
let B = f~ l ([—k, —|]) and C = /“’([§.£])• The sets B and C are disjoint closed 
subsets of A, and since A is closed in X, they are also closed in X. By Lemma 7.30, 
there exists a continuous function g : X -*■ [— |, |] such that g -1 (— |) = B and 

= c. 



FIGURE 7.15: The function g approximates / and maps into [—|, |], 
SE 7.28. Prove that | f(a) — g(a)| < y for all a e A. 


Section 7.4 Limit Point Compactness 245 


Proof of the Tietze Extension Theorem for Metric Spaces. Let A be a closed 
subset of a metric space X, and let f : A-*■ [— 1,1] be continuous. To prove the 
theorem, we build a sequence of continuous functions F„ on X that, on A, are 
progressively better approximations to /. In the limit, we obtain a continuous 
function FonX that is equal to / on A. 

By Lemma 7.31, there exists a g\ : X [-^, j] such that |/(x)-gi(x)| < 
| for all x G A. Now, the continuous function / — g i maps A into the interval 
[— |, §]• Using Lemma 7.31 again, there exists a g 2 : X --*■ [—|, i] such that 
|/(x) — gi (x) — g 2 (x)\ < |. Continuing this process, we obtain a sequence of 
continuous functions 


2 * 1—1 2 n ~ ] 


such that \f(a) — gi(a) — ... — g„(a)| < (|)"foralla G A. 

SE 7.29. Prove that |gi (x) + ... + g n (x)\ < 1 for all x g X. 

For each n e Z+, define F n : X -*■ [—1, 1] by F„(x) = g\ (x) + ... + g n (x). 
Each function F n is continuous, being a sum of continuous functions. 

SE 7.30. Prove that for each x G X the sequence of function values (F n (x)) is 
a Cauchy sequence in [— 1, 1]. 

Since (F„(x)) is a Cauchy sequence in [—1,1] for each x g X, Theorem 7.20 
implies that the sequence ( F„ {x )) converges to a limit F{x) g [—1,1]. In this 
way, a function F : X -*■ [— 1,1] is defined. We claim that F is the desired 
extension of /. First we show that F is continuous. 

SE 7.31. Prove that the sequence of functions F n converges uniformly to the 
function F. 

By the Uniform Convergence Theorem (Theorem 4.13), it follows that F is 
continuous. 

SE 7.32. Prove that F is an extension of /; that is, show that F(a) = f (a) for 
every a e A. 

Consequently, F is the desired continuous extension of /, completing the proof 
of the Tietze Extension Theorem. ■ 


7.4 Limit Point Compactness 

Before mathematicians settled on the definition of compactness we presented 
in Section 7.1, other definitions were considered. For example, there was the 
following candidate: 

DEFINITION 7.32. A topological space X is limit point compact if every 
infinite subset of X has a limit point. 
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EXAMPLE 7.12. In R, the subspace A = {0} U | n e Z+} is limit point 
compact. Let B be an infinite subset of A. Then B must contain values of 
the form 1 /n where n is arbitrarily large. Therefore 0 is a limit point of B. 
Thus, every infinite subset of A has a limit point, implying that A is limit point 
compact. 


EXAMPLE 7.13. Let X be an infinite set with the finite complement topology. 
We show that X is limit point compact. Thus, let B be an infinite subset of 
X. We claim that every point of X is a limit point of B. If x e X, and U is a 
neighborhood of x, then since U contains all but finitely many points of X, it 
intersects B in infinitely many points. In particular, U intersects B in points 
other than x, implying that x is a limit point of B. Therefore every infinite 
subset B of X has a limit point, implying that X is limit point compact. 


Examples 7.3 and 7.12 together establish that in R, the subspace A = 
{0} U | n € Z + } is both compact and limit point compact. The following 
theorem indicates that this is no coincidence: 

THEOREM 7.33. If a topological space X is compact, then it is limit point 
compact. 

Proof. We prove that if X is not limit point compact, then X is not 
compact. Suppose that X is not limit point compact. Thus, X has an 
infinite subset B that does not have a limit point. Since each x e B 
is not a limit point of B, it follows that for every x e B there exists a 
neighborhood U x of x such that U x n B = {x}. Also, since B has no 
limit points, Corollary 2.9 implies that B is closed. Hence, X — B is 
open. Let O be the collection of open sets {U x \ x e B} U [X — B}. 
Then O is an open cover of X. Furthermore O has no finite subcover, 
since each of the infinitely many points x in B is contained in only the 
open set U x in O. Therefore X has an open cover with no finite subcover. 
It follows that X is not compact. ■ 


EXAMPLE 7.14. The disk is compact and therefore limit point compact. 
This implies that every infinite subset has a limit point. However, there are 
infinite subsets that do not have obvious limit points. For example, take the set 
AcM 2 , expressed in (r, 0) polar coordinates as 

and illustrated in Figure 7.16. This set of points spirals out toward the circle S 1 , 
but there is no obvious point where this set accumulates. Even so. Theorem 7.33 
tells us that this set must have at least one limit point. 
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FIGURE 7.16: There must be a limit point of A in the disk. 


j 


The converse of Theorem 7.33 does not hold. As the following example 
demonstrates, limit point compactness does not necessarily imply compact¬ 
ness. 


EXAMPLE 7.15. Let Z have the topology generated by the basis 

B = {(— n, n) \n e Z + }. 

In this topology, Z is not compact since the basis B is an open cover that has 
no finite subcover. We claim, however, that Z is limit point compact in this 
topology. In fact, we show that every nonempty subset of Z has a limit point. 
Thus, let A be a nonempty subset of Z. Every subset of Z has an element 
with minimum absolute value; let s be such an element of A. For every t e Z 
such that |t| > |s|, it follows that every open set containing t must contain s. 
Therefore every t satisfying |f | > |s| is a limit point of A. Hence, A has a limit 
point, implying that Z is limit point compact in the topology generated by B. 


If we restrict ourselves to metric spaces, then limit point compactness and 
compactness are equivalent. This is yet one more example of the convenient 
properties that hold in a metric space. We present this result in Theorem 7.36, 
but before doing so we prove a couple of helpful lemmas. 

LEMMA 7.34. Let (X , cl) be a metric space. If A C X and c is a limit point 
of A, then for every e > 0 the open ball Bj (c, s) intersects A in infinitely many 
points. 

Proof. Let c be a limit point of A. We establish the desired result by 
contradiction. Thus suppose that there is an open ball #</(c, £*) that 
intersects A in at most finitely many points, say a \,... , a n . There is at 
least one point in this list that is not equal to c since c is a limit point of 
A. Therefore 


p = min{d(c, af) | i — 1 ,... , n and a,- ^ c } 
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is defined and positive. It follows that Bd(c, contains no points of 
A except possibly c. But this contradicts c being a limit point of A. 
Therefore for every e > 0, the open ball Bd(c,s) intersects A in infinitely 
many points. ■ 

The second lemma that we need is a Lebesgue Number Lemma for limit 
point compact metric spaces. We are in a somewhat peculiar situation here. We 
have Lemma 7.27, the Lebesgue Number Lemma for compact metric spaces, 
and compactness is equivalent to limit point compactness for metric spaces, so 
does that give us a Lebesgue Number Lemma for limit point compact metric 
spaces? The answer is no. We have not yet shown that compactness and limit 
point compactness are equivalent for metric spaces; in fact, that is what we 
are trying to prove here. Thus, we need to go through the work of proving a 
Lebesgue Number Lemma for limit point compact spaces, even though it is 
ultimately equivalent to the Lebesgue Number Lemma that we already proved. 


LEMMA 7.35. Let (X, d) be a limit point compact metric space, and let O 
be an open cover of X. Then there is a X > 0 such that for every x e X there 
exists a U € O satisfying Bd(x, k) c U. 

Proof. Let (X, d ) be a limit point compact metric space, and assume 
that O is an open cover of X. Since O covers X, it follows that for each 
x e X there exists a U x € O such that x g U x . Let e x > 0 be such that 
Bd( x i £jc) c U x . 

To derive a contradiction, assume that no k exists as specified in the 
statement of the lemma. Then for each n e Z + , we can choose an x n e X 
such that Bd(x n , £) is contained in no set U e O. 

There may be repetition among the points x n , but we claim that the set 
Y = {x„} ne z + contains infinitely many different points. We prove this 
claim by showing that there is no p e X such that p = x„ for infinitely 
many n. To derive a contradiction, assume that there is such a point p. 
Then since p = x n for infinitely many n, we can choose Me Z+ such 
that p = xm and jj < e p . Now, by definition of s p there exists a U p e O 
such that Bd(p, e p ) c U p . Furthermore, Bd(p, jf) C Bd(p, s p ) since 
j-j < e p . Therefore U p is a set in O such that Bd(p, jj) C U p . But 
since p = xm, this contradicts the definition of xm- It follows that there 
is no p e X such that p = x n for infinitely many n, and therefore Y is 
an infinite set. 

Since X is limit point compact, Y has a limit point y. Now Bd(y, s y ) 
is contained in some set O y € O. By Lemma 7.34, since y is a limit 
point of Y, infinitely many of the x„’s lie in Bd(y, ^). Choose N e Z + 
such that xn e Bd(y, y) and y < y. It follows that Bd(xN, jj) C 
Bdiy,Sy) C O y , contradicting the definition of xn- Thus there exists a 
k as specified in the statement of the lemma. ■ 
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Now, we are ready to prove the desired theorem. 

THEOREM 7.36. //(X, d) is a metric space, then X is compact if and only 
if it is limit point compact. 

Proof. Having already established Theorem 7.33, we only need to prove 
that if X is a limit point compact metric space, then X is compact. Thus, 
assume that X is a limit point compact metric space with metric d, and 
let O be an open cover of X. We show that O has a finite subcover. 

By Lemma 7.35 there exists a Lebesgue number A for O. Consider 
the cover of X given by 

C = {B d (x, X) | x g X}. 

We claim that C has a finite subcover. We use such a subcover of C 
to help us obtain a finite subcover of O. 

To prove that C has a finite subcover, we assume that it does not and 
derive a contradiction. Thus assume that no finite subcollection of C 
covers X. Pick jci e X. Then B d (x\ , X) does not cover X, and therefore 
we can pick xi e X — B d (xi, X). Now, [B d (x i, X), B d (x 2 , A.)} does 
not cover X either, so we can pick xj, e X — {B d {x\ , X) U B d (X 2 , A)). 
Continuing this process, we define a set Y — {x n ] ne x+ C X that is such 
that xk Uj=l B d (Xj , A) for all k e Z+. (See Figure 7.17.) 



X 


FIGURE 7.17: The point xk does not lie in any of the balls B d (xi, A) for i < k. 

The points in Y are distinct. Therefore Y is an infinite set. Let y be a 
limit point of Y. Then by Lemma 7.34, B d (y, |) contains infinitely many 
points from Y. Let x p and x q be two such points and assume that p < q. 
Since x p and x q both lie in B d (y, |), it follows that d(x p , x q ) < A. But 
this contradicts the fact that x q $ (Jy=i Bdfrj, A). Therefore we may 
infer that C has a finite subcover. 

Let [B \,... , B m } be a subcover of C. Each B t is an open ball of 
radius A. Since A is a Lebesgue number for the cover O, it follows that 
for each i = 1,... , m, there exists an O,- € O such that B\ C 0\. The 
collection [0 1 ,... , O m \ is then a finite subcover of O. Therefore X is 
compact. B 
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Exercises for Section 7.4 

7.33. Let Rj denote R with the discrete topology. Using the definition of limit point 
compactness directly, determine which of the subsets A — {(x, 2) | 1 < x < 2} 
and B = {(3, y) \ 1 < y < 2}, as previously shown in Figure 7.12, is a limit 
point compact subset of R<* x R. 

7.34. Let Y = {1,2} with the trivial topology and Z + be the positive integers with the 
discrete topology. Prove that the product space Y x Z+ is limit point compact 
but not compact. 

7.35. Show that [0, 1] is not limit point compact as a subspace of R with the lower 
limit topology. 

7.36. Let X be a limit point compact space and A be a closed subset of X. Prove that 
A is limit point compact in the subspace topology. 


7.5 One-Point Compactifications 

As we have already seen, compact spaces and sets have a number of useful 
features. For example 

(i) Compact sets are closed and bounded in a metric space, 

(ii) Sequences have convergent subsequences in a compact subset of a 
metric space, 

(iii) Compact metric spaces are complete, and 

(iv) Continuous functions on compact spaces attain minimum and max¬ 
imum values. 

Furthermore, we have also encountered the useful properties possessed by 
a Hausdorff space. For example 

(i) Single-point sets are closed in a Hausdorff space, and 

(ii) Convergent sequences converge to a unique limit in a Hausdorff 
space. 

Unfortunately, we do not always have the advantages afforded by a compact 
and Hausdorff space in the topological spaces we use. In this section, we 
introduce the property of local compactness and a construction, called the one- 
point compactification, that allows us to add a single point to a locally compact 
Hausdorff space X in order to obtain a compact Hausdorff space Y containing 
X as a subspace. In the case where X = R 3 , we will see that the one-point 
compactification yields the 3-sphere S 3 . 


DEFINITION 7.37. A topological space X is locally compact if every x e X 
has a neighborhood that is contained in a compact subset of X. 
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EXA M PLE 7.1 6 . Every compact space is automatically locally compact since 
each x e X has X both as a neighborhood and as a compact set containing the 
neighborhood. 

_i 


EXAMPLE 7.17. The real line R is locally compact since for each x e R we 
have x € (x — 1, x + 1) C [x — 1, x + 1], and [jc — 1, x + 1] is compact. 

_i 

-i 

EXAMPLE 7.18. The subspace Q of R in the standard topology is not locally 
compact. (See Exercise 7.37.) 

_i 


DEFINITION 7.38. Let X be a Hausdorff space. Set Y equal to the union 
ofX and a single additional point, denoted oo. (See Figure 7.18.) Define the 
open sets for a topology onY = X U {oo} to be subsets of the following two 
types: 

(i) Open sets in X, and 

(ii) Sets of the form Y — C, where C is a compact subset of X. 

We call the resulting topological space Y the one-point compactification 
ofX. 



FIGURE 7.18: Open sets U and Y - C in the one-point compactification Y = X u 
{oo}. 

Of course, we need to verify that the collection of open sets just described 
is a topology. We do that next. 

THEOREM 7.39. Let X be a Hausdorff space. The collection of subsets of 
Y = X U {oo} in the definition of the one-point compactification of X is a 
topology on Y. 
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Proof. The empty set is open in Y since it is an open subset of X. The 
entire set Y itself is open in Y since it is the complement of 0 in Y, and 
0 is a compact subset of X. 

To prove that finite intersections of open sets in Y are open in Y, it is 
enough to check intersections of pairs of open sets U and V. The result 
for arbitrary finite intersections then follows by induction. Thus assume 
that U and V are open sets in Y. We need to check three separate cases. 
First, if both U and V are open sets in X, then U H V is an open set in 
X, making it an open set in Y. Second, assume that U = Y — C\ and 

Y = Y — C 2 , where C\ and C 2 are compact subsets of X. Then U n V = 

Y — (Ci U Cf). Since finite unions of compact sets are also compact, 
Ci U C 2 is a compact subset of X. It follows that U DV = Y — Cfora 
compact subset C of X, and therefore U D V is open in Y in this case as 
well. Finally, assume that U is an open set in X and V = Y — C, where 
C is a compact subset of X. Then since 00 is not in U , it follows that 
U fl V = U n (X — C). Now C is closed in X by Theorem 7.8 since 
it is a compact set in the Hausdorff space X. Therefore X — C is open 
in X, implying that U fl (X — C) is open in X. Thus, U D V is open in 
X, making it open in Y in this case, too. It follows that if U and V are 
arbitrary open sets in Y, then U Pi V is also open in Y, as we wished to 
show. 

Last, we prove that arbitrary unions of open sets are open. We can 
express such an arbitrary union in the form 

(U c/ “> u <U< y -W’ 

aeA /?e B 

where each U a is open in X and each Cp is a compact subset of X. The 
set UaeA ‘ s °P en in X; we denote this by U. Furthermore, 

\J(Y-Cp) = Y-f]Cp, 

B P<=B 

and since each set Cp is a compact subset of the Hausdorff space X, 
Theorem 7.6 implies that C^ is a compact subset of X. Letting 
C = PljSsB Cp, wesee thatlj^ggfy — Cp) = Y — C and C is a compact 
subset of X. Therefore we only need to verify that U U (y — C) is open in 

Y where U is open in X and C is a compact subset of X. Let C' — X — U, 
the complement of U in X; then C' is closed in X and therefore 

UU(Y -C) = (X-C')U(Y - C ) 

= (Y — C') U (F — C) 

= y - (C' n c). 


Now C is a compact subset of the Hausdorff space X, so C is closed in 
X. Therefore C 1 fl C is closed in X, and since C’ fl C is a subset of the 
compact set C, it follows that C fl C is a compact subset of X. Thus 
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Y — (C' fl C) is an open set in Y, implying that U U (F — C) is an open 
set in Y. It follows that an arbitrary union of open sets in Y is an open 
set in Y. Thus the collection of subsets of Y described in the definition 
of the one-point compactification is a topology on Y. ■ 

Now, X is a subset of the one-point compactification Y = X U {oo}. 
Therefore X inherits a subspace topology from Y. The next theorem indicates 
that this subspace topology is the same as the original topology, and therefore 
we can view X as a subspace of its one-point compactification. 

THEOREM 7.40. Let X be a Hausdorff space, and let Y = X U {oo} be its 
one-point compactification. Then the subspace topology that X inherits from 
Y is equal to the original topology on X. 

Proof. See Exercise 7.38. ■ 

Next, we justify the term “compactification.” 


THEOREM 7.41. Let Xbea Hausdorff space. Its one-point compactification 
Y = X U {oo} is compact. 

Proof. Let O be an open cover of Y. Define Ox to be the collection of 
subsets of X given by {V fl X \ V e O}. The sets in Ox are open in the 
subspace topology that X inherits from Y ; therefore, by Theorem 7.40, 
they are open sets in X. It follows that Ox is an open cover of X. 

Now, O is an open cover of Y, and therefore there exists a U e O 
such that oo 6 U. It must be that U = Y — C where C is a compact subset 
of X. The collection Ox covers C. Since C is a compact subset of X it 
follows that there is a finite collection {V] fl X,... , V„ fl X) c Ox that 
covers C. Therefore [0,Vi,... , V n ] is a finite subcover of O, implying 
that Y is compact. ■ 

Although we can construct the one-point compactification of any Hausdorff 
space, the result is not necessarily Hausdorff. For example, the one-point 
compactification of the set of rational numbers, Q, as a subspace of IK, is 
not a Hausdorff space. (See Exercise 7.40.) On the other hand, we have the 
following theorem: 


THEOREM 7.42. Let X be a locally compact Hausdorff space. Then Y — 
X U {oo}, the one-point compactification ofX, is Hausdorff. 

Proof. To see that Y is Hausdorff, let x and y be points in Y . In the first 
case, assume both x and y are in X. Since X is Hausdorff, we can find 
disjoint open sets U and V in X that contain x and y, respectively. The 
sets U and V are also open sets in Y, and therefore there exist disjoint 
neighborhoods of x and y in Y. In the second case, suppose that x = oo 
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and y € X. By the local compactness of X, there is a compact set C in X 
that contains a neighborhood U of y. Now, Y — C and U are open sets 
in Y, are disjoint, and contain x and y, respectively. Thus, in this case 
too, there exist disjoint neighborhoods of x and y in Y. It follows that Y 
is Hausdorff. ■ 


EXAMPLE 7.19. The one-point compactification of the plane is homeomor- 
phic to the sphere, S 2 . In Example 4.16 we showed that the 2-sphere with 
the north pole, N, removed is homeomorphic to the plane via stereographic 
projection. The one-point compactification of the plane essentially fills the 
missing point, N, back in. To make this explicit, we define a function / from 
the sphere S 2 to the one-point compactification R 2 U {oo} by taking each point 
in S 2 — {/V| to the corresponding point in the plane by stereographic projec¬ 
tion and by taking N to oo. (See Figure 7.19.) The resulting function / is a 
homeomorphism. 



FIGURE 7.19: The sphere is homeomorphic to the one-point compactification of 
the plane. 

When the plane is considered to be the complex plane C, then the one- 
point compactification C U {oo} is called the Riemann sphere or the extended 
complex plane. In this setting, the addition of the point oo not only plays a 
topological role, but also an algebraic role (for example, g = oo), and the 
resulting space serves as a domain for many functions studied in complex 
analysis. 


EXAMPLE 7.20. The 3-sphere, S 3 , is the set of points in M 4 that are at a 
distance 1 from the origin. It can be difficult to visualize the 3-sphere since we 
do not have natural experience visualizing 4-space. However, by an argument 
similar to the one that appears in the previous example, we can show that the 
3-sphere S 3 is homeomorphic to the one-point compactification of R 3 . 

When mathematicians study knot theory, they always assume that the em¬ 
beddings of knots are in the compact space S 3 rather than the noncompact 
subspace R 3 . By adding just a single point to R 3 they can take advantage of 
the benefits of working within a compact setting. 
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Exercises for Section 7.5 

7.37. Show that Q is not locally compact in the subspace topology it inherits from R 
in the standard topology. 

7.38. Prove Theorem 7.40: Let X be a Hausdorff space, and let Y = X U {oo} be its 
one-point compactification. Then the subspace topology that X inherits from 
Y is equal to the original topology on X. 

7.39. Show that the one-point compactification of (0,1) is homeomorphic to the 
circle. 

7.40. Show that the one-point compactification of Q is not Hausdorff. 

7.41. (a) Describe and illustrate the result of taking the one-point compactification 

of the open annulus S 1 x (0,1). 

(b) An open Mobius band is the space obtained from [0,1] x (0,1) by gluing 
the ends as we do with the usual Mobius band. Describe and illustrate the 
result of taking the one-point compactification of the open Mobius band. 
(Hint: The resulting space is one that we have previously encountered.) 

7.42. Let X be Hausdorff and assume Y = X U {oo} is the one-point compactification 
of X. 

(a) Show that if X is not compact, then Cl(k') = Y. 

(b) Show that if X is compact, then Cl(AT) = X, and Y is disconnected with 
{oo} being one of its components. (This shows that not much interesting 
happens when taking the one-point compactification of a space that is 
already compact.) 



Chapter 8 


Dynamical Systems 
and Chaos 


The area of mathematics known as dynamical systems has seen tremendous 
interest and growth over the past thirty years. A dynamical system is a particular 
type of function used to model time-varying processes. Examples of such 
processes appear in fluid mechanics, population growth, celestial mechanics, 
cardiac behavior, particle dynamics, and a multitude of situations where a 
physical system changes over time. In the analysis of a specific dynamical 
system, qualitative tools and techniques are often employed, and concepts 
from topology often underlie these qualitative methods. 

Considered one of the leading figures in the development of topology in 
the late 1800s and early 1900s, Henri Poincare (1854-1912) is also generally 
regarded as the principal founder of the field of dynamical systems. Poincare 
studied the three-body problem, modeling the positions and velocities of three 
bodies in motion under each other’s gravitational influence. Because general- 
solution formulas for the associated differential equations are difficult to obtain, 
he took the novel approach of qualitatively studying their structure within the 
space in which they are defined. Thus the field of dynamical systems was bom. 

In this chapter, we consider dynamical systems defined by repeated appli¬ 
cation of a given function that maps a space to itself. In the first two sections 
we examine basic properties of such systems. In Section 8.3, we provide a 
topological definition of a chaotic dynamical system and explore the relatively 
new idea that a system can be completely deterministic, with all of its behavior 
specified by initial conditions and simple rules for its evolution, and yet still 
exhibit the unpredictable behavior known as chaos. In Section 8.4, we present 
a simple population model that helped set off the chaos revolution. Finally, in 
Section 8.5, we prove that sensitive dependence on initial conditions (popu¬ 
larly referred to as “the butterfly effect”) is a consequence of the topological 
definition of chaos. 

8.1 Iterating Functions 

We focus here on dynamical systems defined by repeated application of a 
function that maps a space to itself. Specifically, let X be a topological space 
and / ; X -» X be a function mapping X to itself. For every n € Z+, define 
f n (x) = f o f o ... o fix), the composition of n copies of the function /. 
The idea is that we start with x, then apply f to x, then apply / to f(x), and 
continue this iterative process until we obtain f n {x). 

DEFINITION 8.1. The dynamical system defined by f : X -» X is the family 
of functions {f n } n eZ+> with each f" mapping X to X. 


256 
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EXAMPLE 8.1. Let / : R -> R be given by /(x) = x/3. Then the dynam¬ 
ical system defined by / is the family of functions given by f n (x) = x/3”. 

_i 


In an application of a dynamical system defined by iterating a function 
/ : X X, we think of /(x) as describing the new state of the system one 
unit of time after it was at state x. For example, if we are modeling a bacteria 
population growing by the hour, we might have a function /(x) representing 
the population size that results one hour after the population was x. Or, if we 
are modeling the position and velocity of a rocket, we might have a function 
f(x,v) representing the position and velocity of the rocket one second after it 
had position and velocity (x, v). 

Let us take a look at a few more examples of functions defining a dynamical 
system. 


EXAMPLE 8.2. Consider the following four functions defined on R: 

(i) / : R —» R, defined by /(x) = —2x, 

(ii) g : R R, defined by g(x) = \x, 

(iii) h : R -* R, defined by h(x) = — x, 

(iv) k : R -» R, defined by k(x) = 0. 

When each of these functions is evaluated at x = 0, the result is 0. We 
say that 0 is a fixed point for the associated dynamical system. (We will 
subsequently define “fixed point” and other terms used in this example.) 

First, consider the function /. If we take a particular point xo, then 
/"(xo) = (—2)"xo. Therefore if xo 7 ^ 0, then repeated iteration of / on 
xo results in values that move further and further from 0, bouncing back and 
forth between positive and negative values. The dynamics of / on R are 
qualitatively depicted in Figure 8.1 in what is called a phase diagram for the 
dynamical system. 





k 


FIGURE 8.1: Phase diagrams for /, g, h, and k. 
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Next consider g. Here iteration of g on a nonzero value results in values 
that move progressively closer to 0, approaching 0 in the limit. In this case, 0 
is referred to as an asymptotically stable fixed point. 

With h we see a different dynamic picture. We have the fixed point at 0, but 
with any other xo the result of iterating h is an oscillation between the values 
—xo and xo- Each nonzero value xo is called a period-2 point of the dynamical 
system. 

Finally, consider k. Here the dynamics are simple. Every point, upon 
application of k, is sent immediately to the fixed point at 0. So 0 is a fixed 
point and every other point is referred to as an eventual fixed point. 


DEFINITION 8.2. Let f : X -* X, and assume x € X. 

(i) The orbit of x under f is the sequence 

(X,f(x), / 2 (x),... ,/"(x),...) 
and is denoted O (x). 

(ii) We say that x is a fixed point of f if /(x) = x. So the orbit of a 
fixed point x is a constant sequence at the point x. 

(iii) We say that x is an eventualfixed point of f if x is not a fixed point 
of f but /"(x) is a fixed point for some n 6 Z + . 

(iv) Assume m e Z + . We say that x is a periodic point or a period-m 
point if f m (x) = x and / ; (x) ^ xfor j = 1,... , m — 1. Under 
these circumstances the orbit of x is called a periodic orbit or a 
period-m orbit. Also, we say that m is the period of the periodic 
point or the periodic orbit. 

(v) We say that x is an eventual periodic point if x is not a periodic 
point but /”(x) is a periodic point for some n e Z+. 


If x is a period-m point for a function / : X —> X, then the points 
x, /(x),... , f m - i(x) are all period-m points and are all distinct. (See Exer¬ 
cise 8.3.) Thus, iterating / on x, we continually cycle through these m periodic 
points. 

A fixed point is a period-1 point, so we include the fixed points within the 
set of periodic points of a function. 


EXAMPLE 8.3. Here we consider two simple examples of savings accounts. 
First, suppose that we deposit money in a savings account that earns 5% in¬ 
terest, compounded annually. After the initial deposit, we do not make any 
further deposits to the account nor do we make any withdrawals from it. We 
simply let the amount in the account accrue the earned interest. The function 
/ : [0, oo) -* [0, oo), given by /(x) = 1.05x, defines a dynamical system 
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that models the amount in the account as it changes year by year. The dynam¬ 
ics of / are straightforward: there is a fixed point at 0, and every other point 
has an orbit that increases away from 0 upon successive iteration of /. (See 
Figure 8.2.) 


0 



FIGURE 8.2: The dynamics of /. 


Now assume that after the interest is applied each year, we withdraw either 
$1,000 from the account (if there is at least that much in the account) or the 
balance of the account (if it is less than $1,000). In this case the function 
g : [0, oo) ->• [0, oo), given by 


g (-*0 = 


1.05* - 1000 
0 


if 1.05* > 1000, 
if 1.05* < 1000, 


defines a dynamical system modeling how the amount in the account changes. 
Here too, 0 is a fixed point. There is another fixed point at x = 20,000 that 
we find by solving g(x) — x. We can also find the fixed point at 20,000 by 
reasoning that the amount in the account will be fixed when it is such that the 
interest of 5% provides exactly the $1,000 needed for the annual withdrawal. 
Since 5% of 20,000 is 1,000, it follows that the fixed point occurs at x — 
20,000. For values of x greater than 20,000, the interest on x provides more 
than the amount needed for the $ 1,000 withdrawal, so the amount in the account 
will grow without bound under successive iteration of g. If x < 20,000, then 
eventually the amount in the account will equal 0. So nonzero values of x that 
are less than 20,000 are eventual fixed points of g. We illustrate the dynamics 
of g in Figure 8.3. 


20,000 


FIGURE 8.3: The dynamics of g. 


As we saw in Example 8.3, if we have a dynamical system defined by iterat¬ 
ing a function /, then we can find fixed points by solving the equation f(x) = 
x. Similarly, to find a period-m point, we solve the equation f m (x) = x and 
take the solutions that are not solutions of fi(x) = * for j = 1 ,... , m — 1. 


EXAMPLE 8.4. Imagine a batch of raisin-bread dough lying across the in¬ 
terval [0, 1], Suppose that we uniformly stretch the dough to twice its length 
and then fold the dough over, pressing it together so that it again lies across the 
interval [0, 1]. (See Figure 8.4.) 
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FIGURE 8.4: Stretching, folding, and pressing the raisin-bread dough. 

LetT : [0, 1] —> [0, 1] be defined by setting T (x) equal to the new position 
of a raisin that was originally at x, after the stretching, folding, and pressing 
of the dough. More precisely, T is defined by 

2 x if * e [ 0 , 5 ], 

2 — 2x if x 6 [j, 1 ]. 

The graph of T is shown in Figure 8.5. For obvious reasons, T is referred 
to as the tent function. We also show the graph of the line y = x in Figure 8.5. 
Fixed points of T occur where T(x) = x and therefore at values of x where the 
graph of T intersects the line y = x. We can see that T has two fixed points; 
they are located at 0 and 2/3. 



FIGURE 8.5: The tent function. 


It is straightforward to see that 1 and 1/2 are eventual fixed points of T, but 
these are not the only ones—there are many, many more, and we show how to 
identify all of them in Section 8.3. 
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Now consider T 2 . The graph is shown in Figure 8.6, along with the graph 
of y = x. We see that there are four solutions to T 2 (x) = x\ two of them are 
the fixed points 0 and 2/3, and the other two are at 2/5 and 4/5. The latter two 
are period-2 points and together they form a periodic orbit. 

Here we have only touched on the very complicated dynamics of the tent 
function. We explore T further in Section 8.3, where we show that T is chaotic. 



FIGURE 8.6: The function T 2 . 


The following definition establishes a notion of equivalence for dynamical 
systems defined by function iteration: 

DEFINITION 8.3. The functions f : X -*■ X and g : Y -»• Y (and the 
dynamical systems defined by them) are said to be topologically conjugate if 
there exists a homeomorphism h : X —» Y such that g o h = h o f. The 
function h is called a topological conjugacy between f and g. 

We illustrate the topological conjugacy condition g o h = h o f in Fig¬ 
ure 8.7. The idea is that both routes from the upper-left X to the lower-right 
Y —across the top, then down the right side, and down the left side, then across 
the bottom—give the same result. We say that the diagram commutes. Essen¬ 
tially, h is mapping the function / to the function g. 


X 


f 




t 

Y 


8 


r 

►r 


FIGURE 8.7: Topological conjugacy requires that g o h = h o /. 


EXAMPLE 8.5. The dynamics of the functions f(x) = 2x and g(x) = 3x 
appear qualitatively the same. In both cases there is a fixed point at 0, and all 
other orbits stay either on the positive or negative side of 0 and move outward 
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from 0. In fact, these two functions are topologically conjugate. The function 
h : R -> R, defined by h{x) = x' og2(3) , is a homeomorphism that satisfies 
goh = ho f. 


A topological conjugacy between two functions / and g naturally maps 
orbits of / to orbits of g, as the following theorem indicates: 

THEOREM 8.4. Let h be a topological conjugacy between f : X -*■ X and 
g : Y —> Y. For each x € X and n e Z+, we have h{f n {x)) = g n (h{x)), and 
consequently h maps the orbit ofx under f to the orbit ofh(x) under g. 

Proof. We prove this by induction on n. The n = 1 case holds by the 
definition of topological conjugacy. Assume that the result holds for 
n — 1. Then, 


h(f n (x)) = M/" -1 (/(*))) 

= g n ~\h(f(x ))) 

= g n ~ l (g(h(x ))) 

= g n (h(x)), 

where the second equality holds by the inductive hypothesis. Thus if 
the result holds for n — 1, then it holds for n. Therefore, by induction, 
b(f n (x )) = g n (h(x)) for all n e Z+. ■ 

The following corollary is an immediate consequence of Theorem 8.4: 

COROLLARY 8.5. Let h be a topological conjugacy between f : X —>■ X 
and g : Y -» Y, and assume that x € X. Then the following implications 
hold: 


(i) Ifx is a fixed point of f, then h(x) is a fixed point of g. 

(ii) Ifx is a period-m point of f, then h(x) is a period-m point of g. 

(iii) Ifx is an eventual fixed point of f, then h(x) is an eventual fixed 
point of g. 

(iv) If x is an eventual periodic point of f, then h(x) is an eventual 
periodic point of g. 


Proof. See Exercise 8.9. 


The corollary implies that important dynamic features of / are mirrored 
in functions that are topologically conjugate to /. We will encounter similar 
results throughout the chapter, and thereby see that topologically conjugate 
functions have equivalent dynamic behavior under iteration. 
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Exercises for Section 8.1 


8.1. For each of the following functions / : R -> R, find all fixed points and 
periodic points and sketch a phase diagram for the dynamical system: 


(a) f{x) = x 3 

(b) fix) = -* 3 

(c) fix) = -jc'/3 

(d) fix) = x — x 2 

(e) fix) = | tan -1 

(f) /(*) = 1-x 2 


(g) fix) = 5 sin(x) 

(h) fix) = | sin(x) 

(i) fix) = e x 

Cj) fix) = 2 ix - x 2 ) 
(k) fix ) = x + sin(x) 


8.2. (a) Consider the linear systems L a : K -*■ K defined by L a (*) = ax where 

a € K. Through a collection of phase diagrams, classify the different 
dynamic behaviors seen in L a , as a ranges over the real numbers. 

(b) Show that if L a and L* have the same dynamic behavior (as identified in 
part (a)), then L a and Lb are topologically conjugate. 

8.3. Show that if x is a period-m point for a function / : X -»• X, then the points 
x, fix),... , f m ~ l ix) are all distinct and are all period-m points. 

8.4. Find as many eventual fixed points as you can for the tent function T. (In 
Section 8.3 we show how to identify all of them.) 

8.5. Find a period-3 point for the tent function T. 

8 .6. Consider the second savings-account model in Example 8.3: 


g(x ) = 


1.05* - 1000 
0 


if 1.05* > 1000, 
if 1.05* < 1000. 


(a) Determine g _1 (0). That is, determine the interval of amounts that result 
in a balance of 0 the following year. 

(b) Determine the interval of amounts that result in a nonzero balance after 
n — 1 years but a balance of 0 after n years. 

8.7. In this exercise we ask you to explore, by explicit computation, orbits associated 
with the functions f a : [0,1] -*■ [0, 1], defined by /„(*) = a*( 1 — *). We 
examine this family of functions further in Section 8.4. 

(a) For /o.6 : [0,1] —> [0,1], fo.6ix) = 0.6* (1 — *), explore and compare 
the orbits determined by the initial values 0, 0.1, and 0.7. 

(b) For /i,6 : [0,1] -*■ [0,1], f\.(,ix) = 1.6x(l — *), explore and compare 
the orbits determined by the initial values 0, 0.1, and 0.7. 

(c) For / 3.2 : [0,1] —>■ [0, 1], / 3 . 2 (*) = 3.2*(1 — *), explore and compare 
the orbits determined by the initial values 0.25 and 0.75. 

(d) For /4 : [0, 1] -»■ [0,1], / 4 (x) = 4*(1 — *), explore and compare the 
orbits determined by the initial values 0.261 and 0.262. 

8.8. Find the fixed points of g a : R -> R, g a (*) = a(* 4- * 2 ), for a > 0. On an 
* versus a coordinate system, plot the fixed points, demonstrating how their 
location changes as the parameter a changes. 

8.9. Prove Corollary 8.5: Let h be a topological conjugacy between / : X -*■ X 
and g : Y -> Y, and assume that * € X. 

(a) If * is a fixed point of /, then /(*) is a fixed point of g. 
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(b) If x is a period-m point of /, then /( x) is a period-m point of g. 

(c) If x is an eventual fixed point of /, then f(x) is an eventual fixed point 
of g. 

(d) If x is an eventual periodic point of /, then f(x) is an eventual periodic 
point of g. 

8.10. Define the function g : [0, 1] [0,1] by 


g(x) = 


3x 

2 — 3x 
3x — 2 


forO < x < j, 
for j < x < |, 
for | < x < 1. 


Show that g is not topologically conjugate to the tent function. 

8.11. Let / : X — * X be continuous. Show that if x € X and y = lim„_>.oo /"(*)> 
then y is a fixed point of /. That is, if the orbit of a point converges, then it 
converges to a fixed point. (Hint: Use Theorem 4.7.) 

8.12. Let / : R —> E be continuous, and assume that I and J are disjoint closed 
and bounded intervals such that /(/) C J and f(J) c /. Prove that there is a 
period-2 point of f ini. 

8.13. (a) Let / : R -> R be a homeomorphism. Prove that / has no periodic points 

with period greater than 2. 

(b) Show that for every n eZ+ there exists a homeomorphism / : R 2 -» 1R 2 
with a period-n point. 

8.14. Assume that f : X X is injective. 

(a) Prove that f n :X->X is injective for each n e Z + . 

(b) Prove that / has no eventual periodic points. 

8.15. Consider/(x) = y^. Let X be the maximal subspace of R on which / defines 
a dynamical system. Show that X has three components, and prove that each 
x e X is a period-3 point with an orbit having one point in each component 
of X- 


8.2 Stability 

In many physical settings and mathematical models, we observe behavior that 
converges toward fixed or periodic states. For example, a cake removed from 
an oven has a temperature that cools toward the fixed room temperature, or a 
child on a swing attains a steady oscillation as he or she pumps the swing to 
balance off the effect of damping. In mathematics, we see, for instance, that 
for f(x) = jx every point has an orbit that converges to the fixed point at 0, 
and for g(x) = — x 1 / 3 every point except 0 has an orbit that converges to the 
periodic orbit {—1, 1}. 

In this section we capture these behaviors through what is known as the 
asymptotic stability of a fixed point or periodic point of a dynamical system. 
We begin with the definitions associated with stability. 
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DEFINITION 8.6. Given f : X -» X, assume that x* is a fixed point of f. 

(i) We say that x* is stable if for every open set U containing x* there 
is an open set V containing x* such that for every x e V, the orbit 
of x lies in U. 

(ii) We say that x* is asymptotically stable if x* is stable and if there 
exists an open set U containing x* such that 


lim f n (x)=x* 


for every x e U. 

(iHi) We say that x* is neutrally stable ifx* is stable but not asymptoti¬ 
cally stable. 

(iv) We say that x* is unstable ifx * is not stable. 

DEFINITION 8.7. Given f : X ->■ X, assume that x* is a period-m point of 
f. We say that x* is a stable periodic point, or has a stable periodic orbit, if 
x* is stable as a fixed point of f m . We similarly define asymptotically stable, 
neutrally stable, and unstable for periodic points and periodic orbits. 

Implicit in the definition of stable periodic points and periodic orbits is the 
fact that if x* is a stable period-m point of f, then so are 

/(**),/V). ,/ M ~V). 

The same holds for the definitions of asymptotically stable, neutrally stable, 
and unstable. These facts are not automatic, but must be shown to be true for 
the definitions to make sense. (See Exercise 8.17.) 


EXAMPLE 8.6. We consider again the four functions from Example 8.2: 

(i) / : M -> M, defined by f(x) = —2x, 

(ii) g : E -> M, defined by g(x) = jx, 

(iii) h : R —>■ R, defined by h(x) — —x, 

(iv) k : R R, defined by k(x) = 0. 

There is a fixed point at 0 for each of these functions. For /, note that if we 
take the open interval U = (—1,1) containing 0, then, except for the orbit at 
0, every orbit that begins in U eventually leaves U. Therefore 0 is an unstable 
fixed point of /. Next consider the fixed point at 0 for g. Given any open 
set U containing 0, let V be an open interval (— e, e ) contained in U. Every 
orbit beginning in V stays in V and approaches 0 in the limit. Therefore 0 is 
an asymptotically stable fixed point of g. For k, the fixed point at 0 is also 
asymptotically stable. For h, the fixed point at 0 is neutrally stable and every 
point other than 0 is a neutrally stable period-2 point. 





266 Chapter 8 Dynamical Systems and Chaos 


EXAMPLE 8.7. Consider again the savings account models in Example 8.3. 
For the case where the account accumulates interest, and there are no deposits 
or withdrawals, we have the dynamical system defined by /(*) = 1.05*. For 
this /, we have an unstable fixed point at 0. If we start with any nonzero 
amount of money, no matter how small, the amount in the account grows away 
from 0 . 

In the second savings account model, where we have an annual $1,000 
withdrawal after the application of interest, the dynamical system is defined by 


g(x ) = 


1.05* - 1000 
0 


if 1.05* > 1000, 
if 1.05* < 1000. 


In this case 0 is an asymptotically stable fixed point. If we start the account 
with amounts close to 0 , then we end up at 0 in the long run. (in fact, for small 
enough initial amounts, we go to 0 immediately.) On the other hand, we have 
an unstable fixed point at 20,000. Initial amounts that are greater than 20,000 
increase without bound under this dynamical system, and amounts less than 
20,000 decrease to 0 . 


As already indicated, stable fixed points and periodic points are what we 
often observe or approach in the long run in both the mathematics of a dy¬ 
namical system and the behavior of a physical system that a dynamical system 
models. We see this demonstrated in Examples 8.8 and 8.10. 


EXAMPLE 8.8. Let / : [0, 1] -► [0, 1] be defined by /(*) = 2*(1 - *). If 
we take a value w € ( 0 , 1 ) and use a calculator to compute f(w) and successive 
iterations of / on w, we see that eventually the calculator returns 0.5 every 
time. Through this process we have discovered what appears to be a stable 
fixed point of /. In fact, 0.5 is an asymptotically stable fixed point, and this 
system drives all values in (0, 1) toward it. Therefore being near 0.5, or fixed 
at 0.5, is expected long-term behavior in this system. 

Of course, we could find the fixed point 0.5 for/by solving 2*(1 —*) = *. 
In doing so, we obtain * = 0 or 0.5. So there is also a fixed point at 0. The 
fixed point at 0 is unstable, and therefore the process of successive iteration of 
/ does not reveal this fixed point unless we choose it (or 1) at the start. Being 
fixed at 0 is not expected long-term behavior in this system. 

Even though the calculator brings us to the value 0.5 after finitely many 
iterations of / on w € (0, 1), the point w is actually not an eventual fixed point 
of /. The reason that we eventually obtain exactly 0.5 is that the calculator 
rounds off its results along the way, and at some point it rounds off to exactly 0.5, 
where it is then fixed. So if w € (0,1) and w ^ 0.5, then lim,,-^ f n (w ) = 
0.5, but f n (w ) ± 0.5 for each n. 


A convenient method for viewing the behavior of a dynamical system 
defined on a domain in R is what is known as a web diagram. (See Figure 8 . 8 .) 
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We illustrate this method with the function / : [0, 1] —> [0, 1] from the 
previous example. To begin, we draw the graphs of f(x) = 2x(l — x) and 
y — x. Then we take an initial value a e [0,1] and start with the point (a, a) 
on the line y = x. We go vertically to the point (a, f(a)) on the graph of f(x) 
and then horizontally to the point ( f(a ), f (a)) on the line y — x. Repeating 
these steps brings us to the point ( f 2 (a ), f 2 {a )) on the line y = x. By further 
repeating this process, we obtain the orbit of a, depicted on the line y = x. 



FIGURE 8.8: A web diagram for f(x) - 2x(l - x). 

The following theorem classifies all of the possible fixed-point and stability 
properties of the linear functions f(x) = mx + b: 

THEOREM 8.8. Let f : R —»• R be the linear function f(x) = mx -I- b. 

(i) Ifmf= 1 , then f has a unique fixed point, and that fixed point is 
neutrally stable ifm = — 1, asymptotically stable if\m\ < 1, and 
unstable if \m\ > 1. 

(ii) Ifm = 1, then f has no fixed points ifb 0, and every x € I is 
a neutrally stable fixed point ifb = 0. 

Proof. See Exercise 8.19. ■ 


In Figure 8.9 we use web diagrams to illustrate some of the stability pos¬ 
sibilities asserted in Theorem 8.8. 

Now, if a function is differentiable at a fixed point, then the following 
theorem indicates that the stability of the fixed point can be determined by the 
value of the derivative at the fixed point, as long as that value is not — 1 or 1. 

THEOREM 8.9. Let X be a subset o/M. Assume that xo is a fixed point of 
f : X -*■ X and that f is differentiable at xo- Then x$ is asymptotically stable 
tf\f'(xQ)\ < 1 and xq is unstable tf\f'{xf)\ > 1. 

We do not prove Theorem 8.9. Proofs can be found in standard intro¬ 
ductory texts on dynamical systems, such as [DevR]. The idea is that as 
long as |/'(xo)| ^ 1, the stability of xq as a fixed point of / is deter- 
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FIGURE 8.9: Stability properties of linear functions /(*) = mx + b. 

mined by the stability of jco as a fixed point of the linear approximation to 
/ based at *o- (See Figure 8.10.) The linear approximation is the function 
L{x) = f'(x o)(* - x Q ) + f(x o). 

In the transition cases that are not addressed by Theorem 8.9, where 
|/'(xo)| = 1, it is possible that *o could be neutrally stable, asymptotically 
stable, or unstable. (See Exercise 8.20.) 



FIGURE 8.10: The stability of the fixed point *o is the same for both / and L. 


EXAMPLE 8.9. Let us revisit some of the examples we have previously 
considered to see how Theorems 8.8 and 8.9 verify our observations regarding 
stability. 
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For the savings-account model without withdrawal in Example 8.3, given 
by f(x) = 1.05.x, we have a fixed point at 0. The slope of / at 0 is 1.05, which 
is greater than 1, and therefore the fixed point is unstable. 

For the savings-account model including withdrawal in Example 8.3, given 
by 


g(x) = 


1.05.x - 1000 
0 


if 1.05x > 1000, 
if 1.05x < 1000, 


we have fixed points at 0 and 20,000. The slope of g at 20,000 is 1.05, implying 
that 20,000 is an unstable fixed point. The slope of g at 0 is 0, implying that 0 
is an asymptotically stable fixed point. 

For the function /(x) — 2x(l — x) in Example 8.8, there are fixed points 
at 0 and 1/2. Furthermore, /'(0) = 2, so 0 is an unstable fixed point, and 
/'(1/2) = 0, so 1/2 is a stable fixed point. 

For the tent function T in Example 8.4 we have fixed points at 0 and 2/3. 
The slope of T at the former is 2 and at the latter is —2. It follows that both 
are unstable fixed points. There are also period-2 points at 2/5 and 4/5. We 
can see from the graph of T 2 in Figure 8.6 that the slope of T 2 at each is —4. 
Therefore these periodic points are unstable. Thus, in the tent function we do 
not expect to see the system settle down in the long run toward either of the 
fixed points at 0 and 2/3, nor do we expect it to settle down toward the periodic 
orbit {2/5,4/5}. In fact, as we will see in Section 8.3, there are periodic points 
of T densely distributed over [0, 1], but they are all unstable, so the system 
never settles down to them. 


It is not a coincidence that the derivative of T 2 is the same at each of the 
points in the periodic orbit {2/5, 4/5}. In fact, given / : X ->• X where 
X c R, if xi,... , x m form a period-m orbit of /, and if / is differen¬ 
tiable, then the derivative of f m at each of the points x\ ,... , x m is equal 
to /'(xi)/'(x 2 )... f'(x m ), the product of the values obtained by taking the 
derivative of / at each point in the orbit. (See Exercise 8.21.) 


EXAMPLE 8.10. In this example, we consider a mass that hangs on a spring 
attached to a ceiling. We assume that there is an equilibrium position at which 
the mass rests when there is no motion. The equilibrium position serves as 
the origin for determining the displacement of the mass. Further, we assume 
that when the mass is in motion, it oscillates up and down past the equilibrium 
position and rises no higher than a distance M from the equilibrium position. 
We model the motion of the mass by a function / : [0, M ] —»• [0, M], which 
is defined so that /(x) is the maximum height of the mass on a particular 
oscillation if the maximum height of the mass was x on the previous oscillation. 
(See Figure 8.11.) 

In damped motion, for example that caused by air resistance, we would 
have /(0) = 0 and /(x) < x for all otherx, as shown on the left in Figure 8.12. 
Thus we would have a stable fixed point at 0, and in the long run we would 
expect to see the spring-mass system settle toward equilibrium. 
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FIGURE 8.11: The spring and mass in oscillation. 

On the other hand, assume that we are exciting the motion, for example 
by giving the spring a downward push every time it passes the equilibrium 
position in the downward direction. In this case we would have /(0) = 0 and 
f(x) > x for all other x, as shown on the right in Figure 8.12. Here we would 
have an unstable fixed point at 0. Once this system was in motion it would stay 
in motion with successively greater and greater oscillations. 




FIGURE 8.12: The graphs for damped motion and excited motion. 

Let us see what happens when we combine these two scenarios, having 
small-scale excitation and large-scale damping. Assume we provide a push 
on the mass when the maximum points of the oscillations are near 0, and we 
let damping take hold when the maximum points are large. We assume our 
function / is such that /(0) = 0, f{x) > x for x near 0, f(x) < x for 
large x, and / is continuous as it transitions between the small-scale excitation 
and large-scale damping. (See Figure 8.13.) The Intermediate Value Theorem 
guarantees that we have a point xq > 0 with /(xo) = xo; that is, we have 
a fixed point. Note that the fixed point does not represent a situation where 
the mass is fixed, but instead represents a maximum height above equilibrium 
to which the mass returns at each oscillation. We regard this as a sustained 
oscillation—one that neither dampens nor grows. Thus, small-scale excitation 
with large-scale damping yields a sustained oscillation. 
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If, furthermore, the graph of / is as shown in Figure 8.13, then 0 < 
fix o) < 1 and ,\'o is stable. Therefore the sustained oscillation is a behavior 
toward which we would expect this system to settle in the long run. 



FIGURE 8.13: Combining small-scale excitation and large-scale damping. 


Sustained oscillations, as seen in Example 8.10, are common in many 
physical systems. As you blow on the reed of a woodwind instrument, you are 
producing a small-scale excitation that is opposed by the restoring force tending 
to keep the reed flat. The oscillation that results from the balance between the 
excitation and the restoring force produces the sound that becomes music in the 
instrument. The van der Pol oscillator is a classic differential-equation model 
for a sustained oscillation. It was first introduced by Balthazar van der Pol 
(1889-1959) as a model for cardiac oscillations, but he also examined it more 
generally as a model for electronic circuits. The differential equation reflects 
both a small-scale excitation and a large-scale damping, and consequently its 
solutions tend toward a sustained oscillation. (See [Str].) 

Exercises for Section 8.2 

8.16. For each of the following functions determine the fixed and periodic points and 
for each such point use Theorem 8.9 to determine its stability: 

(a) fix) = x 3 (f) fix) = \ sin(x) 

(b) fix) = -x 3 (g) fix) = | sin(x) 

(c) fix) = -x>/ 3 (h) fix) = 2(x -x 2 ) 

(d) fix) = £tan _1 (x) (i) fix) = x + sin(x) 

(e) fix) = 1 - x 2 


8.17. (a) Prove that if x* is a stable period-m point of a function /, then so are 
fix*), fix*),... ,f m ~\x*). 

(b) Prove that if x* is an asymptotically stable period-m point of a function 
/, then so are fix*), fix*),... , / m_1 (x*). 

(c) Prove that if x* is a neutrally stable period-m point of a function /, then 
so are fix*), fix*),... , /' n_1 (x*). 

(d) Prove that if x* is an unstable period-m point of a function /, then so are 
fix*), fix*),... ,f m -\x*). 
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8.18. Draw web diagrams for the dynamical systems in Examples 8.3, 8.4, and 8.8, 
illustrating the stability properties discussed in Example 8.9. 

8.19. Prove Theorem 8.8: Let / : K -»■ R be the linear function fix) = mx +b. 

(a) If m 1, then / has a unique fixed point, and that fixed point is neutrally 
stable if m — — 1, asymptotically stable if \m\ < 1, and unstable if \m\ > 1. 

(b) If m — 1, then / has no fixed points if b # 0, and every x e R is a 
neutrally stable fixed point if b = 0. 

8.20. In the transition cases that are not addressed by Theorem 8.9, where |/'(xo)| = 
1 at a fixed point xo of /, it is possible that jco could be neutrally stable, 
asymptotically stable, or unstable. 

(a) Find an example of a function / with a fixed point xo where |/'(xo)| = 1 
and xo is neutrally stable. 

(b) Find an example of a function / with a fixed point xo where |/'(xo)| = 1 
and xo is asymptotically stable. 

(c) Find an example of a function / with a fixed point xo where |/'(xo)| = 1 
and xo is unstable. 

8.21. Assume I C 1 and / : X —>• X is differentiable. Show that if x\ ,... , x m 
constitute a period-m orbit of /, then for each i, 

(f m y(xi) = f'(x l )f , ( X2 )...f'(x m ). 

8.22. Newton’s method is an iterative process for approximating the zeros of a func¬ 
tion / : D -* K with D C R. We assume that / is twice differentiable. For 
such an / define g(x) = x — jr^-y The idea behind Newton’s method is to 
make an initial guess xo of a zero of /, and then iterate g on xo to obtain a 
sequence of points, x„ = g"(xo), that (we hope) converges to a zero of /. 

(a) Show that if x is a fixed point of g then x is a zero of /. 

(b) Show that every fixed point of g is asymptotically stable. 

If x is such that /'(x) = 0 then g is not defined at x. Such an x could be a 
zero of /. If z is a zero of / for which /'(z) 0, then the results of parts (a) 

and (b) imply that if we make an initial guess xo sufficiently close to z, then the 
orbit of xo under g converges to z. 

Now, consider the situation where we have two fixed points zi and zi of g with 
no fixed point between them. By (b) there are neighborhoods U\ and U 2 of 
zi and Z 2 , respectively, such that the orbit of each point in t/,- converges to z/. 
Something must happen between zi and Z 2 to separate the points whose orbits 
converge to zi from those whose orbits converge to Z 2 - At the very least, we 
can do the following: 

(c) Prove that between every pair of fixed points of g there is a point at which 
g is not defined. 


8.3 Chaos 

In 1961, Edward Lorenz, a meteorologist at the Massachusetts Institute of 
Technology, was attempting to simulate weather patterns on a computer using 
a model derived from twelve relatively complicated equations. Rather than 
repeat the entirety of a computer run from the previous day, he decided to begin 
the calculation anew using data output from partway through the previous run. 
However, instead of inputting the data to the six-decimal-place accuracy that 
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was retained by the computer, he utilized the three-decimal-place accuracy 
that the computer printout provided. Much to his surprise, the results of the 
run were entirely different from what they had been before. He was seeing 
sensitivity to initial conditions. Even though the equations were fixed, just a 
small change in the input conditions caused a major change in the outcome. 
This is often called the “butterfly effect,” as a butterfly fluttering its wings in 
Hong Kong might change conditions enough to eventually cause a tornado in 
Texas. In other words, small changes can have dramatic impact. This is why 
long-term weather prediction is so difficult. In “Deterministic nonperiodic 
flow,” his now-classic paper about this discovery, Lorenz wrote, 

When our results... are applied to the atmosphere... they indicate 
that prediction of the sufficiently distant future is impossible by 
any method, unless the present conditions are known exactly. In 
view of the inevitable inaccuracy and incompleteness of weather 
observations, precise very-long-range forecasting would seem to 
be non-existent. 

With the aid of a computer, Lorenz was observing a phenomenon that Henri 
Poincare had intuitively described sixty years earlier, as a result of his study 
of the three-body problem. In a 1903 essay, “Science and Method,” Poincare 
suggested, 

If we knew exactly the laws of nature and the situation of the uni¬ 
verse at the initial moment, we could predict exactly the situation 
of that same universe at a succeeding moment. But... we could 
still only know the initial situation approximately. If that enabled 
us to predict the succeeding situation with the same approximation , 
that is all we require... . But it is not always so; it may happen 
that small differences in the initial conditions produce very great 
ones in the final phenomena. [Pet] 

Between the times of Poincare and Lorenz, others had also glimpsed this 
intriguing phenomenon, but it took the introduction of the computer as an ex¬ 
perimental tool in mathematics and science for it to be seen and understood 
broadly enough to be recognized as a robust part of many physical and math¬ 
ematical systems. 

A mathematical and scientific revolution was bom, and over the subsequent 
decades many scientists and mathematicians worked to identify, describe, and 
define this phenomenon, its properties, and its consequences. Chaos is the 
name that is widely used for the general structure and behaviors that result in 
sensitive dependence on initial conditions. 

Along with the unpredictability of sensitive dependence on initial condi¬ 
tions, chaos also includes an element of regularity. Orbits in a chaotic system 
can appear periodic over a span of time (possibly very long), but eventually 
they diverge into another realm of behavior, perhaps a different apparent peri¬ 
odicity. The regularity of the Earth orbiting the sun might be an approximate 
long-term periodicity in an overall chaotic system. (See [Pet] for a further 
discussion about this possibility.) 
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In this section we present a topological definition of chaos, and we discuss 
some essential features of chaos: it is deterministic, but unpredictable; it pos¬ 
sesses hidden regularity, and at the same time mixes together all regions of the 
domain. We also examine the tent function further and present two approaches 
to showing that it is chaotic. Finally, we introduce sensitive dependence on 
initial conditions and present a theorem showing that every continuous chaotic 
function on an infinite metric space has sensitive dependence on its initial 
conditions. 

DEFINITION 8.10. Let X be a topological space. A function f : X -> X is 
said to be chaotic or to have chaos if 

(i) The set of periodic points of f is dense in X, 

(ii) For every U, V open in X, there exists x e U and n e Z+ such 
that /»W € V 


The first condition indicates that there is regular periodic behavior densely 
distributed throughout the domain. No matter what point we choose in the 
domain, there are periodic points arbitrarily close by. 

The second condition, referred to as topological transitivity, indicates that 
every pair of regions in the domain is mixed together by the system. Given 
any pair of open sets, there is at least one point in the first set that, on some 
iteration, is mapped into the second set. 

Consider again the tent function 


T(x) = 



if* € [ 0 , 1 ], 
if * e [$,!]. 


In Section 8.1 we presented the graphs of T and T 2 . In Figure 8.14 we 
show the graphs of 7 ’ 3 and T 4 . The pattern is apparent. The graph of T n 
results in a “tent” over [^ry, for each j = 1,2,... , 2" _1 . It follows 
that each such interval contains two intersections of the graph of T n with the 
line y = x. These intersection points are periodic points of T. Thus, as n 
gets larger and larger, the intervals [4jEr, ^rr] partition [0,1 ] into smaller and 
smaller intervals, each of which contains periodic points. 



FIGURE 8 14: The graphs of T 3 and T 4 . 
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Furthermore, T n maps each interval ^hr] onto [0, 1]. So as n gets 
larger and larger, smaller and smaller intervals are getting spread out or “mixed” 
by T n over the whole interval [0, 1]. Thus the two main components of chaos 
appear to be present in the tent function. In a set of supplementary exercises at 
the end of this section, we work through the details in this approach to proving 
that T is chaotic. 

However, we will now take a different approach to proving that T is chaotic; 
this approach is based on binary expansions of the real numbers in [ 0 , 1 ], 
Specifically, we use the fact that every x e [0, 1] can be expressed in the form 


2 


a 2 a m 

+ — + + —■ + 
+ 2 2 ‘ ‘' 2 m 


where each a- t equals either 0 or I. For such x, we have the binary expansion 
x — .a\ci 2 ■ ■ ■ a m - 

For a = 0 or 1, we let a* = 1 — a. Note that (a*)* = a. Also, 
if x has binary expansion .a\a 2 .. .a m ... then 1 — x has binary expansion 
.a*a*...a* n .... 

Now, if x e [0, \ ] then x has a binary expansion in the form , 0 a 2 ... a m ..., 
and it follows that 


n, 0 / 0 a 2 . a m \ 

2 x = 2 y-+ J = .a 2 a 3 ...a m ... . 

Furthermore, if x e [ 5 , 1], then x has a binary expansion in the form 

. la 2 ... a m ...; so 1 — x = .0 a| • • • ..., and 2 — 2 x = .a^a^ ... a,* _ 

(Note that binary expansions are not unique. For example, 1/2 can be expressed 
as both . 1000000 ... and . 01111111 ..., and for that reason we include 1/2 in 
each of the possibilities, x e [ 0 , j]orx e [ 5 , 1 ], presented here.) 

Therefore, using binary expansions, we can express the tent function 
T : [0, 1] [0,1] as follows: 


T(.a\ci2 ■ • ■ @m 


■ ) = 


M 2 C 13 ...a m . 
• a 2 a t •■■ a m 


if a\ = 0 , 
if a\ = 1 . 


For example, T (.000...) = .000..., confirming our fixed point at 0. The 
binary expansion for 1 is . 111 ..., and T(.ll 1 ...) = . 000 ..., so 7(1) = 0 . 
For 2/3, the binary expansion is .1010..., and 7(.1010...) = .1010...; thus 
(as we knew already) 2/3 is a fixed point. For 2/5 we have binary expansion 
.01100110..., and 7(.01100110...) = .11001100... = 4/5. On the other 
hand, T (.11001100...) = .011011... = 2/5. Therefore, as we saw earlier, 
we have period-2 points at 2/5 and 4/5. 

With this formulation of T, we also have a simply way to express the value 
of T n {.a\a .2 ... a m . ..), as the following lemma indicates: 
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LEMMA 8.11. For .a\ci 2 . ■ ■ a m • • • e [0, 1] and n e Z+, 


T n (.a\a 2 ...a m ...) = 


I -Qn+l&n+2 • ■ • tf — 0 > 

■ a n+l a n+2--' if a n = F 


Proof We prove this by induction on n. For n = 1, the result follows 
by the definition of T. Assume that the result holds for n — 1. Then 

T n (.a\a 2 ...a m ...) = T (T n ~ x {.a\a 2 ... a m ...)) 

T(.a n a n+ \...) if a n -\ — 0, 

~ \T(.a*a * + !•••) if^n-i = 1 , 

where the second equality holds by the inductive hypothesis. Assume 
that a n = 0; then T n (.a\a 2 ... a m ...) equals either T(.0a n+ ]a n +2 ■ ■ ■) 
or T (. \a* +[ a* (2 ...). In either case, by the definition of T, the result is 
.a n+ \a n+ 2 ..., as desired. 

Now assume that a„ = 1. Then T n (.a\a 2 .. .a m ...) equals either 
T (. \a n+ \a n+ 2 ■ ■.) or T (. 0 a* +] a * +2 ...), and in either case we obtain 
the desired result, .a n+l a n+2 --‘- 

Thus, if the result holds for n — 1 it holds for n, and it follows by 
induction that the result holds for all n € Z + . ■ 


In our proof that T is chaotic, we use the following lemma, which indicates 
that if two points in [ 0 , 1 ] have associated binary expansions that agree in their 
first n entries, then the distance between those two points is at most 


LEMMA 8.12. Let x and y have binary expansions .a\a 2 ■ ■ .a m ... and 
.b\b 2 ... b m ..., respectively. If at = h,- for i = 1,... , n, then \x — y\ < ^. 


Proof We have 

\*-y\ = 


E oc 

i=, 


— 2-,j=n +1 


j=n +1 2 i 

a i~ b i 
2 i 


< sr-oo J_ 

— L-,j=n+\ 2 i 

- - _L X 

2 " Z _,;'=1 2 1 


± 

2 n 


(since aj — bj < 1 ) 


Now we establish that T is chaotic. 


(since Ef=l Y> = 0- 
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THEOREM 8.13. The tent function T is chaotic. 

Proof. We begin by showing that periodic points of T are dense in [0, 1 ]. 
It suffices to show that if x e [0, 1] and s > 0, then there is a periodic 
point p such that \x — p\ < e. Thus let x e [0, 1] and e > 0 be arbitrary. 
Assume .a\aj.. .a m ... is a binary expansion of x and n e Z+ is large 
enough that ^ < e. If we let 

p = .a\a 2 • • ■ a n 0 a\a 2 . •. a n 0 a\a 2 ... a n 0 ... , 

then by using Lemma 8.11 it follows that p is a periodic point, and 
Lemma 8.12 implies that |x — p\ < e. Thus periodic points of T are 
dense in [0, 1]. 

To show topological transitivity, let U and V be open in [0, 1], and 
let x = .a\a 2 ... a m ... € U be arbitrary. Since U is open, there exists 
e > 0 such that (x — e, x + «) fl [0, 1] e U. Let n e Z+ be large enough 
that < e. Pick y = .b\b 2 ... b m ... € V, and consider the point 

p — ,a\a2 . ..a n 0bib2b2 .... 

By Lemma 8.12, |x — p\ <jn< e, and therefore p e U. Lemma 8.11 
implies that T n+l (p) = y e V. It follows that T is topologically 
transitive and therefore is chaotic. ■ 

In the proof of the topological transiiivity of T, the binary expansion 
.b\b 2 ■ ■ - bm ... was entirely arbitrary and could be associated with any point 
in [0, 1]. What we actually proved was that, given U open in [0, 1], there 
exists n e Z + such that T n maps U onto [0, 1]. This is not a surprise, since we 
noted this phenomenon earlier. Specifically, for an open set U in [0, 1] there 
is m 6 Z+ large enough that some interval ^rr] is contained in U. We 
previously discussed that T m maps each such interval y] onto [0, 1J. 

Although it does not play a role in the definition of chaos, we can also see 
that eventual fixed points of T are dense in [0, 1]. It follows from Lemma 8.11 
that every point in [0, 1] having an associated binary expansion that ends in 
all 0s, all Is, or alternating 0s and Is is an eventual fixed point. (In fact, this 
accounts for all of them—see Exercise 8.23.) Given x with binary expansion 
.ai <22 ... a m .... the point with binary expansion ,a\a 2 ... a n 000 ... is within 
2 ^ of x, and therefore arbitrarily close to x there are points that are eventual 
fixed points. 

It is straightforward to see that the periodic points of the tent function are 
all unstable. In fact, the following theorem indicates that all of the periodic 
points are unstable for every chaotic function defined on an infinite Hausdorff 
space. Thus, even though periodic points are densely distributed throughout 
the domain, we do not expect to see any of them in the long-term in such a 
chaotic system. 

THEOREM 8.14. Let X be an infinite Hausdorff space. If f : X —» X is 
chaotic then every periodic point of f is unstable. 


Proof. See Exercise 8.25. 
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In general, a direct proof that a function is chaotic is not nearly as easy to 
formulate as it is for the tent function. However, topological conjugacy can 
help identify chaotic functions. The following theorem shows that, as we might 
expect, chaos is preserved under topological conjugacy. Thus, given a chaotic 
function, every function that is topologically conjugate to it is also chaotic. 


THEOREM 8.15. Iff and g are topologically conjugate functions and f has 
chaos, then g has chaos. 

Proof. See Exercise 8.26. ■ 

The function Q : [0, 1] —*■ [0, 1] defined by Q(x) = 4x(l — x ) is topo¬ 
logically conjugate to the tent function. A topological conjugacy is given by 
the homeomorphism h : [0, 1] -» [0, 1] defined by h(x) = sin 2 (~-x). It is 
straightforward to show that hoT = Q oh. (See Exercise 8.27.) 

Thus Theorems 8.13 and 8.15 imply that Q is chaotic. Note that the graph 
of <2 has features similar to those in the graph of T. (See Figure 8.15.) Both 
functions can be viewed as stretching the interval [0, 1] to twice its length, and 
then folding the stretched interval over so that (except at x = 1/2) it maps 
two-to-one onto [0, 1]. 



FIGURE 8.15: The function Q(x) = 4x(l - x). 

The stretching and folding present in the functions T and Q is typical of 
the nonlinear behavior that causes chaos. The stretching tends to drive orbits 
apart while the folding keeps the dynamics confined to a compact domain. 
This combination of behaviors can result in sensitive dependence on initial 
conditions and the unpredictability present with chaos. Sensitive dependence 
on initial conditions is a metric property, defined as follows: 


DEFINITION 8.16. Let ( X , d) be a metric space. A function f : X -*■ X 
has sensitive dependence on initial conditions if there exists a 8 > 0 such that 
for every x e X and e > 0, there exists y € Bd(x,e ) and n e Z+ such that 
d(f n (x),f"(y))>8. 
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Thus, a function / has sensitive dependence on initial conditions if there 
is a distance S such that arbitrarily close to every point x there are points whose 
image under some iteration of / is more that S away from the image of x under 
the same iteration of /. We refer to S as a sensitivity constant for /. 

In Section 8.5 we will prove that chaos implies sensitive dependence on 
initial conditions. In the introduction to this section, we indicated that chaotic 
behavior is deterministic, but unpredictable, and that it possesses hidden reg¬ 
ularity, while at the same time mixing together all regions of the domain. The 
hidden regularity and mixing are captured in the definition of chaos via the 
dense set of periodic points and topological transitivity, respectively. Being 
deterministic is a feature of all dynamical systems; the idea is that if you 
know the starting point, then the future is completely determined by the func¬ 
tion defining the dynamical system. The unpredictability in a chaotic system 
comes as a result of the sensitive dependence on initial conditions. If there is 
any uncertainty or error in the knowledge of the initial state, then—as Lorenz 
suggested about the atmosphere—precise long-term predictions are not possi¬ 
ble. 


Exercises for Section 8.3 


8.23. Prove that x € [0, 1] is an eventual fixed point of the tent function if and only 
if the binary expansion of x ends in a sequence of all Os, all Is, or alternating 
Os and Is. 


8.24. (a) Show that if X has the trivial topology, then / : X -*■ X is chaotic if and 

only if / has a periodic point. 

(b) Show that if X has the discrete topology and / : X -*■ X is chaotic, then 
X is finite and / is a cyclic permutation of the elements of X. 

8.25. Prove Theorem 8.14: Let X be an infinite Hausdorff space. If / : X — ► X is 
chaotic then every periodic point of / is unstable. 

8.26. Prove Theorem 8.15: If / and g are topologically conjugate functions and / 
has chaos, then g has chaos. 

8.27. Verify that the homeomorphism h : [0,1] —► [0,1], defined by h(x) = 
sin 2 (^;t), is a topological conjugacy between the tent function and the function 
Q : [0, 1] -* [0, 1], defined by Q(x) = 4*(1 - x). 

8.28. Show that the functions Q : [0,1] —> [0,1], defined by Q(x) = 4jc(1 — x), 
and R : [—1,1] —> [-1, 1], defined by R(x) = 1 - 2x 2 , are topologically 
conjugate. (It follows R is chaotic.) 

8.29. Show that the function g : [0,1] —» [0,1], defined by 


g(x) - 


3x 

2 — 3x 
3x — 2 


forO < x < 
for 5 < x < |, 
for | < x < 1 , 


is chaotic, by using ternary expansions to show each of the following: 

(a) Periodic points are dense. 

(b) g is topologically transitive. 
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8.30. Show that if / : X -*■ X has a point x whose orbit is dense in X then / is 
topologically transitive. (The converse of this theorem is also true if X is a 
compact subset of either R or S 1 , but the proof requires tools not introduced in 
this text.) 

8.31. On S 1 c R 2 , let 6 represent the point at angle 6 measured counterclockwise 
from the positive ac-axis. Define / : S l -*■ S 1 by f(6) — 26. In (a) and (b) we 
prove that / is chaotic. 

(a) Prove that for n, j e Z + , the points with angular representation are 
fixed points of /", and thereby show that periodic points of / are dense 
in S l . 

(b) Prove that every interval of the form [fc>, 6 + C S' 1 is mapped onto S 1 
by /", and then use this result to show that / is topologically transitive. 


Supplementary Exercises: Another Approach to Proving 
that the Tent Function is Chaotic 

We now provide a second proof that the tent function T is chaotic. This approach is 
easy to visualize given the nature of the graph of T n . We begin with a lemma asserting 
that the graph of T n is a “tent” over each interval [jj^, ^rp]. 

LEMMA 8.17. For each w € Z + and j = 1,2,, 2"~\ we have 

j~ 1 j ~ 1/2 ] 

2 »-i ’ 2 n_1 J ’ 

j - 1/2 j 1 
2«-i ’ 2"— 1 J 

In particular, over ^rp] the function T n equals 0 at the endpoints, equals 1 at 
the midpoint, and is linear on each half of the interval. 

SE8.32. Prove Lemma 8.17. (Hint: Use induction; the n = 1 case holds by the 
definition of T .) 

With the lemma, it is straightforward to prove the desired theorem: 

THEOREM 8.18. The tent function T is chaotic 

Proof. We begin by showing that periodic points are dense in [0,1], It suffices 
to prove that every open set in [0,1] contains a periodic point. Let U c [0,1] 
be an arbitrary open set. ■ 


T n (x) = 


2 n x — 2j + 2 ifxe 

-2 n x + 2 n +2j — 2 ifxe 


SE 8.33. Prove that there exist n, j € Z+ such that the interval lies in U. 

SE 8.34. With n as in the previous exercise, prove that T n has a fixed point in U and 
therefore that there is a periodic point of T in U. 

Next we establish topological transitivity. Let U and V be arbitrary open sets in [0,1]. 

SE 8.35. Use Exercise SE 8.33 and Lemma 8.17 to show that there exists n e Z+ such 
that V C T n (U), and thereby conclude that T is topologically transitive. 
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8.4 A Simple Population Model with Complicated Dynamics 

The title of this section is an adaptation of the title of a paper, “Simple math¬ 
ematical models with very complicated dynamics,” by physicist and mathe¬ 
matical biologist Robert May, published in the journal Nature in 1976. May’s 
paper was a landmark in the chaos revolution. Along with a presentation of 
interesting mathematical results and their potential applied consequences, this 
paper was a call to the scientific community to realize that simple nonlinear 
equations can possess a wide spectrum of behaviors from fixed point to periodic 
to chaotic, and that unpredictable behavior in a modeled system, previously 
thought to be “noise” arising externally from the environment, could be a result 
of chaotic behavior internal to the system. May further urged that these ideas 
be introduced early in students’ mathematics-education experiences, “so that 
students’ intuition may be enriched by seeing the wild things that simple non¬ 
linear equations can do.” In this section, we examine the logistic population 
model discussed by May and highlight some of the features of the model’s 
behavior. 

Assume that we have a species whose population from generation to gen¬ 
eration is given by a function F, where F(p) represents the population in the 
next generation resulting from a population of p in the current generation. We 
assume that the population in the next generation is directly proportional to 
two factors: the population in the current generation and the amount of room 
in the environment in which the population can grow. Thus F(p) — kpR(p) 
where R(p) is a function reflecting the room in which the population can grow. 
We assume that the environment can hold a maximum population of M and 
that R(p) is simply given by M — p, the difference between the maximum 
population and the current population. Thus F{p) = kp(M — p). 

To simplify matters, instead of working with the variable p, we work with 
x = p/M, the fraction of the maximum possible population represented by p. 
Consequently, our model takes on the form f(x) — ax(l — x), with domain 
[0,1]. We call this the logistic growth function. We think of the parameter a 
as a growth rate, and we distinguish different functions with different growth 
rates by writing them as fa, rather than /. 

We are only interested in growth rates a such that fa maps [0, 1] back 
into [0, 1], This occurs for a e [0,4]. By the logistic family, we mean for 
the family of logistic functions corresponding to parameter values [0,4]. We 
sketch the graphs of /o, fa, / 2 , fa, and fa in Figure 8.16. For fa the dynamics 
are relatively simple: no matter what the population is in one generation, it 


fo 




f2 



FIGURE 8.16: The logistic functions fa, f\, / 2 , fa, and fa. 
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goes immediately to 0 in the next. For values of a near 0, the dynamics of 
fa are not much different. Specifically, such f a have an asymptotically stable 
fixed point at 0, and every orbit in [0,1] approaches 0 in the limit. Thus, for 
small values of a the growth rate is not large enough to sustain a nonzero 
population, and initial nonzero populations go to 0 in the limit. On the other 
hand, at the other end of our parameter domain where a = 4, we have the 
function fa(x) = 4x( l — x), which we showed to be chaotic in the previous 
section. 

Thus we are left with the question, how do the dynamics of f a change as 
we go from a function fa, where all points immediately map to 0, to a function 
fa that is chaotic? The road to chaos is a fascinating one. Unfortunately, it 
cannot be analyzed, or even described, in just a few pages. We discuss only 
some of the highlights seen along the way. Further details can be found in 
many of the introductory texts on dynamical systems and chaos and in other 
references we will point out. 

To begin, consider Figure 8.17. We call it an orbit diagram; it depicts how 
the expected long-term behavior of the functions fa evolves as a transitions 
from 0 to 4. In the diagram, the horizontal axis represents the parameter a, 
and the vertical axis represents the variable x in the domain, [0, 1], of each of 
the functions fa. The diagram is generated by setting a equal to each of the 
9601 values, 0, ^gg, ^gg, ■■■ , §f§§, 4, and doing the following for each: 

(i) Compute the first 2075 points in the orbit of 0.5 under fa-, that is, 
compute 

/«(0.5), / 2 (0.5)./ a 2075 (0.5). 

(ii) In the diagram, plot the following points corresponding to the last 
75 of the 2075 computed orbit points: 

(a, / 2001 (0.5)),... , (a,/ 2075 (0.5)). 



FIGURE 8.17: The orbit diagram for the logistic family. (Courtesy of Chip Ross.) 
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Thus for each a e [0,4], in the vertical slice at a in the diagram, we are 
seeing an estimate of the long-term behavior of f a . For example, at a = 1, 
just the value 0 is plotted. Therefore for f \, the expected long-term behavior 
is to approach a fixed point at 0. For a = 2, a single point at 0.5 is plotted. 
Thus for /2 the expected long-term behavior is to approach a fixed point at 
0.5. Also, for a = 3.4, values at approximately 0.45 and 0.85 are plotted. 
This reflects that for 73 . 4 , the expected long-term behavior is to approach a 
periodic orbit oscillating between approximately 0.45 and 0.85. Let us verify 
these observations analytically and further explore the behavior depicted in the 
orbit diagram. 

To determine the fixed points for f a , we solve f a (x) = x. We find solutions 
at 0 and at 2=1. Note that 2=1 is outside our domain [0, 1] for a e [0, 1], but 
inside the domain for a e [1,4]. Using Theorem 8.9 to analyze the stability 
of the fixed points, we see that 0 is asymptotically stable for a e [ 0 , 1 ) and 
unstable for a € (1,4], and 2=1 } s asymptotically stable for or e (1,3) and 
unstable for a e (3, 4]. (See Exercise 8.36.) In Figure 8.18, we plot these 
fixed points on an x versus a coordinate system similar to that used in the orbit 
diagram. We use a solid curve to indicate the values where the fixed point is 
asymptotically stable and use a dashed curve to indicate the values where the 
fixed point is unstable. 



FIGURE 8.18: The fixed points of the logistic family and their stability. 


The value of a where the fixed point 0 loses its stability is exactly the value 
of a where the fixed point 2=1 merges with the fixed point at 0 and enters 
our domain. Furthermore, the fixed point 2=1 appears to lose its stability at 
approximately the location where the period -2 orbit comes into our picture 
in the orbit diagram. We explore the birth of the period-2 orbit next. The 
changes in stability we see here are examples of behavior changes known as 
bifurcations. 
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To find period-2 points for f a , we solve f„(x) = x. Along with the fixed 
points of f a , we also find solutions at 

1 + a ± -fa 2 — 2a — 3 
2a 

These are period-2 points, and for each a they form a period-2 orbit. The 
period-2 points are defined for a e (3,4], and they emerge in our domain 
exactly where we thought they would, at the value of a where the fixed point 
—i loses its stability. At a = 3 the fixed point has undergone what 
is known as a period-doubling bifurcation; it has given up its stability to 
an emerging period-2 orbit. If we use Theorem 8.9 to determine the stabil¬ 
ity of these period-2 points, we find that they are asymptotically stable for 
a € (3,1 + V6) and unstable for a e (1 + a/6, 4]. (See Exercise 8.38.) In 
Figure 8.19 we expand Figure 8.18 to include the period-2 points. Note the 
similarity between Figure 8.19 and the orbit diagram over the parameter values 
0 < a < 1 + a/6. 



FIGURE 8.19: The fixed points and period-2 points of the logistic family. 

As we observed, the period-2 points lose their stability at 1 + a/6. What 
happens there? In Figure 8.20 we zoom in on the orbit diagram from Fig¬ 
ure 8.17, considering only the parameter values 3.4 < a < 3.6. Examining 
this diagram, it appears that an asymptotically stable period-4 orbit has been 
born at the point where the period-2 orbit loses its stability. Thus we have had 
another period doubling. It is difficult to verify this analytically by solving 
fa(x) = x, because that involves solving a polynomial of degree 16. How¬ 
ever, the period doubling can be verified by examining the equations /„ Qc) = x 
graphically for values of a near 1 + a/6. (See Exercise 8.40.) 

We are still just at the beginning of the story behind the orbit diagram. As 
the parameter a increases, the period-4 orbit bifurcates to a period-8 orbit that 
bifurcates to a period-16 orbit, and so on. These parameter intervals between 
period doublings get shorter and shorter, and the parameter values associated 
to the period doublings approach a limit L ~ 3.57. 
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0.9 


x 


0.3 

3.4 ol 3.6 

FIGURE 8.20: The orbit diagram for f a with 3.4 < a < 3.6. (Courtesy of Chip Ross.) 

At the period-doubling limit L we have infinitely many periodic orbits, 
and we enter a region of the parameter domain where a variety of interesting 
behaviors are seen. Again, we cannot provide all of the details, but here are 
some of the highlights: 

(i) There are infinitely many parameter values a where f a is chaotic 
on an infinite subset of [0, 1], 

(ii) For each a e [L, 4], f a has infinitely many periodic points. 

(iii) There is an open dense subset P of [L, 4] such that for all a e P, 
f a has exactly one asymptotically stable periodic orbit. For all 
other a € [L, 4], there are no asymptotically stable periodic orbits 
Oifa. 

(iv) For each odd n, there is an open interval of parameter values where 
there is an asymptotically stable period-n orbit. 

(v) Each of the just mentioned period-n orbits arises out of what is 
known as a tangent bifurcation (see Exercise 8.42) and eventually 
loses its stability at the beginning of a cascade of period doublings 
similar to that previously observed. 

In the orbit diagram, we can see “windows” where the asymptotically stable 
orbits with odd period exist and where their subsequent period doublings take 
place. For example, in Figure 8.21 we examine more closely the part of the 
orbit diagram corresponding to 3.8 < a < 3.9. We can see that the period-3 
orbit enters at approximately a = 3.83 and then bifurcates to a period-6 orbit 
at approximately a = 3.84. The birth of the period-3 orbit is addressed further 
in Exercise 8.43. 
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x 


0 

3.8 « 3.9 

FIGURE 8.21: The orbit diagram for f a with 3.8 < a < 3.9, showing the birth of a 
period-3 orbit. (Courtesy of Chip Ross.) 

The logistic family and others like it have been extensively studied to 
uncover the dynamic story behind the orbit diagram. (See [DevR] and [Gil], 
for example.) One fascinating result is that the spectrum of behaviors we have 
described is not unique to the logistic family, but can be seen in other families 
of functions defined over an interval where the family rises through a collection 
of one-humped functions to a function that fills out the interval with chaos. The 
family of functions s a (x) = a sin(7rx), defined on [0,1], with a going from 0 
to 1, is one such family. 

Now, what does all of this say about the population being modeled by the 
logistic equation? A main theme of May’s paper was that even very simple 
functions can exhibit complicated dynamics, and therefore the complex be¬ 
haviors observed in simple natural systems could be inherent to the systems, 
rather than a result of influences external to them. Fixed points, periodic points, 
and chaos are all phenomena to be expected in natural systems, and as param¬ 
eters change in the systems, we can expect exotic transitions between these 
phenomena similar to those observed in the logistic family. 

As the chaos revolution took hold, many mathematicians and scientists 
began to examine systems for the behaviors seen in the logistic family. The 
dynamics in this simple model have helped deepen the understanding of phe¬ 
nomena such as turbulence in fluid flow, motions of celestial bodies, outbreaks 
of diseases, and vibrations in machinery. 

Chaos is not the only source of complicated, unpredictable behavior, but the 
identification of chaos and its associated structures has helped mathematicians 
and scientists better understand the dynamics of nonlinear systems. And, of 
course, topological concepts have played an important role along the way. 

Exercises for Section 8.4 

In these exercises, f a refers to the functions defined by f a (x) — ax(\ — x) with 
a e [0,4], 

8.36. Determine the fixed points of f a , and use Theorem 8.9 to deduce their stability. 
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8.37. Over the domain x e [—1,1] sketch separate web diagrams for /„ with a 
slightly less than 1, equal to 1, and slightly greater than 1. Discuss the changes 
you observe in the fixed points and their stability as the parameter a passes 
through 1. The bifurcation observed here is known as a transcritical bifurca¬ 
tion. 

8.38. Determine the period-2 points of f a , and use Theorem 8.9 to deduce their stabil¬ 
ity. (Hint: Finding the period-2 points involves solving a degree-4 polynomial; 
the fixed points of f a are two of the solutions to the polynomial.) 

8.39. Using a computer graphing program, examine the graphs of the function f 2 
and the line y = x for a slightly less than 3, equal to 3, and slightly greater 
than 3 in order to approximate the fixed points of f 2 . Which fixed points of 
f 2 that you observe are fixed points of f a and which are period-2 points of 
f a ? Discuss the changes you observe as the parameter a passes through 3. The 
bifurcation observed here is known as a period-doubling bifurcation. 

8.40. Using a computer graphing program, examine the graphs of f„ and the line y = 
x in order to approximate the value of a where the period-doubling bifurcation, 
from an asymptotically stable period-2 orbit to an asymptotically stable period-4 
orbit, takes place. 

8.41. Consider the family of functions gx : R -»• R given by gx(x) = Xx — x 3 with 
X e [0, 2]. 

(a) Determine the fixed points and their stability. 

(b) Discuss the bifurcation that takes place over the parameter domain, and 
include illustrations showing how the graph of gx changes (in relation to 
the line y = x) as the bifurcation takes place. This bifurcation is called a 
pitchfork bifurcation. It is a pitchfork bifurcation that occurs at a = 3 in 
fa that causes the first period-doubling bifurcation in the logistic family. 

8.42. Consider the family of functions h\ : R -*■ R given by hx(x) = X + x — x 2 
with X e [—1,1]. 

(a) Determine the fixed points and their stability. 

(b) Discuss the bifurcation that takes place over the parameter domain, and 
include illustrations showing how the graph of hx changes (in relation to 
the line y = x) as the bifurcation takes place. This bifurcation is called a 
tangent bifurcation. Explain the name. 

8.43. Using a computer graphing program, examine the graphs of f 2 and the line 
y = x in order to approximate the value of a where the tangent bifurcation 
takes place that gives rise to the period-3 window shown in Figure 8.21. 

8.5 Chaos Implies Sensitive Dependence on Initial Conditions 

In this section we prove that the metric notion of sensitive dependence on initial 
conditions is a consequence of the topological definition of chaos. Specifically, 
we have the following theorem: 

THEOREM 8.19. Let X be an infinite metric space and f : X -> X be 
continuous and chaotic. Then f has sensitive dependence on initial conditions. 

This result was proven by John Banks, Jeff Brooks, Grant Cairns, Gary 
Davis, and Peter Stacey in 1992 in [Ban]. In some of the first formulations 
of chaos, sensitive dependence on initial conditions was included among the 
requirements for a function to be chaotic. (For example, see [DevR].) The 



288 


Chapter 8 Dynamical Systems and Chaos 


realization that sensitive dependence on initial conditions is a consequence of 
the two topological requirements for chaos made it possible to formulate a 
definition of chaos solely in topological terms. 

Proof. Let X be an infinite metric space with metric d, and assume that 
/ : X -* X is continuous and chaotic. We show that there exists a S > 0 
such that for all x € X and s > 0, there exist y e Bd(x, s) and n e Z+ 
such that d(f n (x), f n (y )) > <$. 

To begin, choose two periodic points q and q' whose orbits are dis¬ 
joint. Let S 0 be the minimum distance between the orbits (that is, Sq is 
the minimum distance between any two points, one on the orbit of q, 
the other on the orbit of q'). Since the orbits are disjoint, <5o > 0. Set 
S = <5o/8. We claim that S satisfies the requirements of the theorem. 
Thus assume that x € X is arbitrary and s > 0. Without loss of general¬ 
ity, we may assume that s < S. We show that there exist y e Bd(x, s) 
and n e Z + such that d(f n (x), f n (y)) > S. 

Since periodic points of / are dense, there exists a periodic point p 
in B ( i(x , e). Assume that p has period m. 

Either the distance from x to every point in the orbit of q is greater 
than <$o/2 or the distance from x to every point in the orbit of q' is 
greater than Sq/2. Otherwise, by the triangle inequality, there is a pair 
of points—one in the orbit of q and the other in the orbit of q' —whose 
distance apart is less than Sq, contradicting the definition of So. Without 
loss of generality, assume that the distance from x to every point in the 
orbit of q is greater than Sq/2 = AS. 

Let Bo be the open ball about q of radius 8, and for each j = 
1,... , m, let Bj be the open ball about f J (q) of radius S. By the conti¬ 
nuity of each f J , the sets (f J )~' (Bj) are open sets. Furthermore, each 
such set contains q. Set 


Wj . 

The set V is an open set containing q. Note that the m used here is the 
period of p and is not necessarily the period of q. The idea behind the 
construction of V is to find a neighborhood of q such that for every v in 
the neighborhood, the points f 1 (v),... , f m (v), obtained from the first 
m iterations of / on v, are all within S of the orbit of q. The reason for 
needing this will become clear shortly. 

By topological transitivity there exist w e Bd(x,s) and k e Z+ such 
that f k (w) € V. The integers k and k + maxem apart, so there exists h e 
Z + suchthatfc < hm < k+m. Weclaimthat<i(/ ftm (/?), f hm (w)) > 28. 
This implies that the distance between f hm (x) and either f hm (p) or 
f hm (w) must be greater than S, and since both p and w are in Bd(x, s), 
this shows that S satisfies the requirements of the theorem. We return to 
this point in a moment. 


v = B 0 n p| ((/'') 


-l 
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To prove the claim that d(f hm (p), f hm (w)) > 25, first note that 
f hm (p) = p, so we show that d(p, f hm (w)) > 28. Also note that 
since f k (w) € V, the next m iterations of / on w are guaranteed (as 
we indicated earlier) to all lie within 8 of the orbit of q. In particular, 
f hm (w) € Bhm-k, and therefore d(f hm (w), f hm ~ k (q)) < 8. Now, by 
the triangle inequality, 

d(x, f hm ~ k {q)) < d(x, p ) + d(p, f hm (w )) + d(f hm (w), f hm ~ k (q )) 
< 8+ d(p, f hm (w))+S 
= 28 + d(p,f hm (w)). 

Since the distance between x and the orbit of q is greater than 45, it 
follows that 45 < d(x, f hm ~ k (q)). Therefore45 < 25 + d(p, f hm (w)), 
implying that dip, f hm (w )) > 25. Thus, we have established the claim 
thatd(f hm (p), f hm (w)) > 25. 

Now, let n = hm. Either 

d{f n {x), f n (w)) > 8 or d(f n {x), f n (p )) > 5, 

otherwise we would have 

d(f n (x), f n (w)) < 8 and d(f n (x), f n (p)) < 5, 

which, by the triangle inequality, would imply that d(f n (p), f n {w)) < 
25, a contradiction. Since both w and p lie in Bd{x , e), there exists y 
in Bd(x, e ) and n e Z+ such that d(f n (x ), /"(y)) > 5, completing the 
proof of Theorem 8.19. ■ 


Following the results of Banks and his colleagues, others explored further 
relationships between the existence of a dense set of periodic points, topo¬ 
logical transitivity, and sensitive dependence on initial conditions. A number 
of interesting results emerged. For example, in 1994 Michel Vellekoop and 
Raoul Berglund showed in [Vel] that for /, an interval in R, if /:/->/ is 
continuous and has topological transitivity, then / has a dense set of periodic 
points and therefore is chaotic. Thus, for a continuous function on an interval 
in R, topological transitivity suffices to assert the existence of chaos. 

Also, in 1997 PatTouhey showed in [Tou] that for continuous f : X —> X, 
the two conditions for chaos in Definition 8.10 can be consolidated into one 
necessary condition via the following theorem: 


THEOREM 8.20. A continuous function f : X -* X is chaotic if and only if 
for every pair of open sets U and V in X there is a periodic point x € U and 
ak € Z+ such that f k {x) e V. 


Proof. See Exercise 8.47. 
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The condition for a function / to be chaotic, specified in Theorem 8.20, 
states that every pair of open sets in the domain of / is visited by some periodic 
orbit of /. Touhey further showed that given a finite collection of open sets 
in the domain of a chaotic function /, there is a periodic orbit of / that visits 
each of the open sets. 

Although Theorem 8.20 allows for a simplification of the definition of 
chaos to one necessary condition (in the case of a continuous function), the 
definition with two necessary conditions (Definition 8.10) is particularly re¬ 
vealing because it highlights the dyadic nature of chaos: the regularity present 
in the dense collection of periodic points and the irregularity resulting from 
topological transitivity. 

Exercises for Section 8.5 

8.44. Here we show that neither topological transitivity nor the existence of a dense 
set of periodic points is by itself enough to imply sensitive dependence on initial 
conditions. 

(a) Find a function / : X -» X that have a dense set of periodic points in its 
domain but does not have sensitive dependence on initial conditions. 

(b) Let / : S i -*■ S l be defined by f{9) =0 + 1; that is, / is rotation of the 
circle by one radian. Prove that / is topologically transitive but does not 
have sensitive dependence on initial conditions. (Hint: Given an interval 
U in the circle, prove that there exists n e Z+ such that f n (U) n U ^ 0, 
but f n (U) # U. Then consider the sets f mn (JJ) form e Z + .) 

8.45. In this problem, we demonstrate that sensitive dependence on initial conditions 
is not preserved under topological conjugacy. Let / : (1, oo) ->• (1, oo) be 
defined by f(x ) = 2 jc, and let g : R+ —► R+ be defined by g(x) = l + x. 

(a) Prove that / has sensitive dependence on initial conditions, but g does not. 

(b) By finding an explicit homeomorphism h : (1, oo) —► K + , prove that / 
and g are topologically conjugate. 

8.46. Exercise 8.45 demonstrates that sensitive dependence on initial conditions is 
not generally preserved under topological conjugacy, but, as we show in this 
exercise, if we restrict to a compact domain, then it is. 

Let X and Y be compact metric spaces with metrics dx and dy, respectively. 
Assume that / : X —> X has sensitive dependence on initial conditions with 
sensitivity constant S, and assume that g : Y -*■ Y is topologically conjugate 
to / via a homeomorphism h : X -*■ Y. 

(a) Prove that C = {(yi,y 2 > e Y x Y \ d x {h~ l {y\), h~\y2)) > 5} is a 
compact subset of Y x Y. 

(b) Prove that dy (yi , y 2 ) > 0 for all (yi , y2 ) e C and therefore dy takes on a 
minimum positive value S* on C. 

(c) Prove that g has sensitive dependence on initial conditions with sensitivity 
constant 5*. 

8.47. Prove Theorem 8.20: A continuous function / : X —► X is chaotic if and 
only if for every pair of open sets U and V in X there is a periodic point x e U 
and a k e Z+ such that f k (x) e V. 



Chapter 9 


Homotopy and Degree Theory 


Although we have discussed how we might compare two different topological 
spaces, we have not talked about how we might compare two functions. Is there 
a useful equivalence for functions? In Section 9.1, we introduce the concept 
of homotopy, a continuous deformation of one function to another. Properties 
shared by two functions that are homotopic to each other play an important 
role in the results we present in this chapter. 

In Section 9.2, we apply homotopy to functions we call circle functions. 
The circle is a natural space to use when modeling situations where a variable 
is periodic that is, repeating the same behavior over successive time intervals of 
a fixed length. Circle functions arise in situations where one periodic variable 
is being mapped to another. For example, when a beating heart undergoes a 
stimulus, the time in the beat cycle at which the stimulus occurs is mapped to 
the time in the beat cycle when the heart recovers from the stimulus and beats 
again. 

Circle functions are continuous functions mapping the circle to itself. For 
such functions we introduce the notion of degree, a quantity that measures how 
many times the function wraps the circle around itself. It is a straightforward 
consequence of our definition of degree that two circle functions are homotopic 
if and only if they have the same degree. We define degree in Section 9.2, and 
in Section 9.6 we prove some technical results behind the definition of degree. 

In Section 9.2, we also introduce retractions and use degree to prove the 
Two-Dimensional No Retraction Theorem (a result that we use in Chapter 10 
to prove the Two-Dimensional Brouwer Fixed Point Theorem). In Section 9.3 
we present an application of degree to a heartbeat model. There, with simple 
assumptions about a beating heart, we draw the conclusion that the heart cannot 
respond continuously to a stimulus applied at varying strengths and times in 
its beat cycle. We discuss the surprising conclusion that a relatively small 
stimulus to the heart at just the right moment in the beat cycle may throw the 
heart into fibrillation. 

In this chapter, we also consider some applications of degree in mathe¬ 
matics. In Section 9.4 we use degree to prove the Fundamental Theorem of 
Algebra. In Section 9.5 we use degree to prove that R 2 — {0} and S 1 are not 
simply connected, then use those results to distinguish the plane from R”, for 
n > 3, and to distinguish the circle from S n , for n >2. 

9.1 Homotopy 

Here we introduce homotopy, a concept that makes precise what it means to 
deform one function to another. 
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DEFINITION 9.1. Let f, g : X —► Y be continuous functions. Assume 
that I = [0, 1] has the subspace topology it inherits from R and that X x / 
has the product topology. We say that f and g are homotopic if there ex¬ 
ists a continuous function F : X x / —► Y such that Fix, 0) = fix) and 
F{x, 1) = g(x). Such a function F is called a homotopy between f and g. 
The expression f — g denotes that f and g are homotopic. 

Figure 9.1 depicts a homotopy F. With X x I having the product topology, 
we think of a homotopy as an interval, I, of maps from X to Y. For a fixed 
value t in I, the function F|x x (/j is a map from X to Y. As the values for 
t vary over the interval I, the map varies continuously over a one-parameter 
family of continuous maps, starting with / and ending with g. 


1 


0 


FIGURE 9.1: A homotopy deforms map / to map g. 



EXAMPLE 9.1. Define F : R x / -> R by Fix, t) = x +1. The function 
F is continuous since addition is continuous. So F is a homotopy between 
fix ) = Fix, 0) = x and g(x) = F(x, 1) = x + 1. The map / is the identity 
map on R, sending each point in R to itself. The map g translates the entire 
real line one unit in the positive direction. For a fixed value of t, the homotopy 
Fix, t ) translates the real line a distance t. As t varies from 0 to 1, F takes us 
from / to g. 


EXAMPLE 9.2. Define / : / -» R 2 by /(x) = (cos( 7 rx), — sin( 7 rx)) and 
g : / -> R 2 by g(x) = (cos( 7 rx), sin( 7 rx)). Both of these maps are examples 
of paths. 

They are homotopic, but how do we see the homotopy? We take a straight- 
line homotopy between them. We call it a straight-line homotopy since for 
each value of x 6 I, we deform fix) to g(x) along the line segment between 
them. (See Figure 9.2.) During the homotopy, at each t between 0 and 1, we 
have moved that percentage of the way from fix) to g(x). 

Define F : / x / -*■ R 2 by F(x, t) = (cos(jtx), (1 — 2 1) sin(^x)). The 
function F (x, t ) is continuous since it is made up of compositions and products 
of continuous functions. Furthermore, F(x, 0) = fix) and F(x, 1) = g(x), 
and therefore F is the desired homotopy. 
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FIGURE 9.2: A homotopy in R 2 from path / to path g. 


In the previous example, if we considered the two paths / and g to have 
range R 2 — {O}, then the straight-line homotopy between them is no longer 
valid, since the function F, as defined, does not send I x / to R 2 — {0}. 
However, even though F does not work as a homotopy, the two paths are 
homotopic in R 2 — {O}. (See Exercise 9.4.) 


THEOREM 9.2. The relation ~ is an equivalence relation on the set of all 
continuous functions f : X —► Y. 

Proof. We must show that the relation is reflexive, symmetric, and tran¬ 
sitive. To show that ~ is reflexive, let / : X —»• Y be a continuous 
function. Define a homotopy F : X x / -* Y by Fix, t ) = fix). Then 
Fix, 0) = f{x) and Fix, 1) = fix), so / ~ /. 

That ~ is symmetric follows from the fact that if Fix, t) is a ho¬ 
motopy between / and g, then G : X x I —> Y, defined by Gix, t) = 
Fix, 1 — t), is a homotopy between g and /. So / ~ g implies g — f. 

To prove transitivity, let /, g, and h be continuous maps from X 
to Y. Suppose that / is homotopic to g via the homotopy F and g is 
homotopic to h via the homotopy G. Then, as illustrated in Figure 9.3, 
define a function H : X x / — x Y by 

FKx t\=\ F{x,2t) for 0 < t < 5 , 

1 ’ |G(x, 2t — 1) for j < r < 1. 

For t = 1/2 the expressions Fix, 21) and Gix, 2t — 1) both equal gix), 
and hence they agree on the set where they are both used in the definition. 
It follows that H is continuous by the Pasting Lemma. Since H{x, 0) = 
fix) and H(x, 1) = hix), we have proved that / ~ g and g — h imply 
f-h. ■ 
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FIGURE 9.3: Forming H from F and G. 

Since an equivalence relation on a set yields a partition of the set into 
equivalence classes, the set C(X, Y ) of all continuous functions / : X -> Y 
is partitioned into equivalence classes under the relation ~. Consequently, we 
have the following definition: 

DEFINITION 9.3. Let C (X, Y) denote the set of all continuous functions 
f : X —»■ Y. Define the homotopy classes in C(X, Y) to be the equivalence 
classes under the relation ~. The homotopy class containing a function f is 
denoted [/]. 


EXAMPLE 9.3. Every continuous map / from R to R is homotopic to the 
trivial map go : R -» R given by go(x) = 0. The function F(x, t ) = 
(1 — t)f(x) is a homotopy between / and go- It follows that the collection 
C(R, R) consists of the single homotopy class [go]. 


Exercises for Section 9.1 

9.1. Show that the identity map from the disk D in the plane to itself is homotopic 
to the map that takes D to the origin. 

9.2. Show that if f\, fa : X -*■ Y are homotopic and g\, g 2 : Y —> Z are homotopic, 
then g 2 o /2 is homotopic to g\° f\. 

9.3. Let / and g be paths in R. Show that / is homotopic to g. 

9.4. Let / and g be paths in R 2 — { O}. Show that / is homotopic to g. (Hint: Show 
that every path is homotopic to the constant path that sends the entire interval 
to the path’s starting point. Then show that two constant paths are homotopic 
using the fact that R 2 — {O) is path connected.) 

9.5. Let f : I -*■ Y and g : / -» Y be paths in Y such that /(0) = g(0) = yo 
and /(1) = g(l) = y\ . (Therefore / and g have the same initial point and the 
same endpoint.) A path homotopy from / to g defines a continuous function 
F : I x / —> Y such that F(x, 0) = f{x), F(x, 1) = g{x), F( 0, t ) = yo, and 
F(l,f) = yi. 

(a) Prove that path homotopy defines an equivalence relation on the set of all 
paths in T that share the same initial point and the same endpoint. 
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(b) Show that every path p : / -> M beginning at 0 and ending at n is path 
homotopic to the direct path in R from 0 ton, that is, to the path : / —> M 
given by p n (x ) = nx. 

9.6. Show that if X is a topological space, and D is the disk in the plane, then there 
is only one homotopy class of continuous functions from X to D. 

9.7. Consider the following definition: 

DEFINITION 9.4. A topological space X is said to be contractible if the 
identity function on X is homotopic to a constant function. 

(a) Prove that contractibility is a topological invariant. That is, prove that if 
X and Y are homeomorphic, then X is contractible if and only if Y is 
contractible. 

(b) Prove that R" is contractible. 

(c) Let X be a contractible space. Prove that X is path connected. 

9.8. Let Y have n path components. Prove that there are n homotopy classes of 
paths in C(7, T). 

9.2 Circle Functions, Degree, and Retractions 

Our primary focus in this section is on continuous functions / : S 1 -*■ S l . 
Such functions are called circle functions. Even though we focus on the circle 
S 1 , the results presented here carry over to every space X that is homeomorphic 
to the circle. We define the degree of a circle function, and then use degree to 
prove the Two-Dimensional No Retraction Theorem, which states that there is 
no continuous function that takes the disk D in the plane to its circle boundary 
S 1 while fixing S 1 . 

To represent the points on the circle, we use the variable 9, where 8 is 
the usual angular measure taken from the positive x-axis in the plane. (See 
Figure 9.4.) We assume two values 0\ and 62 represent the same point on the 
circle if they differ by an integer multiple of In. 



FIGURE 9.4: We represent the points on the circle with the variable 9. 

To visualize the behavior of circle functions, we can display their graphs. 
We think of the graph of a function / : S 1 S 1 as the set 

{(x,y) eS l xS l \ y = f(x)}. 

The graph is a subset of the torus, and if we consider the torus as a square 
[0, 2 n] x [0,2 n ] with opposite edges identified, then we can exhibit the graphs 
of circle functions as subsets of [0, 2 n] x [0, 2 n] (with it understood that the 
opposite edges are identified). For example, in Figure 9.5 we display the graphs 
of f(9) = f, g(8) = 8 + f, h(8) = 28, and k(8) = -8. 
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f 8 h k 


FIGURE 9.5: Graphs of the circle functions /, g, h, and k. 

Our goal is to make precise the notion of degree, a measure of how many 
times a circle function / : S 1 -»■ S 1 wraps the circle around itself. We would 
like this measure to be equal to 1 for the identity function on the circle, equal 
to —1 for the function f (8) — — 6 , and—more generally—equal to n for the 
function c n (0) = nd. (See Figure 9.6.) 



FIGURE 9.6: The functions c„ wrap n times around the circle. 

A circle map, not quite as nice as c n (6) = nd, might oscillate as it wraps 
around the circle. (See Figure 9.7.) Or it might wind a few times around the 
circle and then backtrack a few times. It could also pause for a while as it 
wraps around the circle. Or it might possess a variety of these and other types 
of complicated behaviors. 



FIGURE 9.7: Oscillations, backtracks, and pauses are some of the complicated 
behaviors we may observe in circle maps. 

The goal behind defining the degree of a circle map is to deform away all 
of the complicated behavior via homotopy. Fortunately, we can do this as a 
result of the following theorem: 

THEOREM 9.5. For each circle function, f : S l —>■ S 1 , there exists a unique 
n € Z such that f is homotopic to c n (0) = n8. 
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DEFINITION 9.6. The unique n associated to f : S 1 —> S l in Theorem 9.5 
is defined to be the degree of f and is denoted deg( f). 

We prove Theorem 9.5 in Section 9.6. From the definition, we obtain 
the following theorem, which indicates that degree determines the homotopy 
classes of circle functions: 

THEOREM 9.7. Two circle functions f,g:S l —>S l are homotopic if and 
only ifdeg(f) = deg(g). 

Proof. Since homotopy is an equivalence relation, the distinct homotopy 
classes of circle maps are disjoint sets. Theorem 9.5 implies that they 
are given by {[c„(0)] | n e Z}. Hence / ~ g if and only if [/] = [g] = 
[c n (0)]. This occurs exactly when deg(/) = deg(g) = n. ■ 

The following corollary of Theorem 9.7 is used in our application of degree 
to a heartbeat model in the next section: 

COROLLARY 9.8. Let F : S l x [a, b] -» S 1 be continuous. If for each r 6 
[a, b] we let F r : S l S l be the circle function defined by F r (9) = F(6, r), 
then the degree of F r is independent of r. 

Proof See Exercise 9.10. ■ 

Corollary 9.8 indicates that if we have a continuous function F mapping 
an annulus to the circle, then, as illustrated in Figure 9.8, if we restrict F to a 
circle of radius r, concentric with the annulus, then the degree of the resulting 
circle function F r does not depend on r. 



FIGURE 9.8: The degree of the restriction F r does not depend on the radius r. 

The next theorem is an extension theorem concerning circle maps with 
degree 0 ; it will be of use to us throughout the chapter. 

THEOREM 9.9. A circle function f : S 1 —► S 1 has degree 0 if and only if f 
extends to a continuous function on the disk D (that is, if and only if there exists 
a continuous function F : D -»• S 1 such that F(x) = f(x) for all x € S l ). 
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Proof. Throughout the proof, we represent the points in the disk with 
polar coordinates (r, 6). 

First, assume that / extends to a continuous function F : D S l . 
Define a function G : S 1 x / —> S 1 by G(8, t) = F(t,9). Since F 
is continuous, so is G. Therefore G is a homotopy between the circle 
functions G| 5 i X { 0 j and Gj s i implying that these functions have the 
same degree. The function G| s i X { 0 } is given by 

G| s i x{O ,(0) = G(0, 0 ) = F(0, 0) = F( 0 , 0 ). 

It follows that G| s i X { 0 ) is a constant function and therefore has degree 
0. Thus, G|$i x{1 } also has degree 0. It is straightforward to see that 
G| s » x {(} is equal to /, implying that / has degree 0, as desired. 

Now assume that / has degree 0. Therefore there exists a homotopy 
G : S 1 x I -> S 1 such that G(9 ,0) = co(9) and G(9, 1) = f(9), where 
c'o : S 1 —»• S 1 is the constant function sending each point 9 e S 1 to the 
point 0 € S l . Define F : D -*■ 5 1 by F(r, 9) = G(9, r). Since G(9, r) 
is constant in 9 when r = 0, it follows that F is well defined at r = 0 and 
therefore is defined as a function on D. Furthermore, G being continuous 
implies that F is as well. Finally, F(1,0) = G(9, 1) = f(9), implying 
that F is an extension of /. ■ 

The following result is another useful fact that can be determined from the 
degree of a circle function: 

THEOREM 9.10. If a circle function f : S 1 -*■ S l has a nonzero degree, 
then f is surjective. 

Proof. See Exercise 9.11. ■ 

We now introduce the concept of a retraction, which is a continuous func¬ 
tion that maps a space onto a subspace while fixing the subspace. 

DEFINITION 9.11. Let X be a topological space and A be a subset of X. 
A retraction from X onto A is a continuous function r : X -» A such that 
r{a) = a for every a € A. If there exists a retraction r : X —> A, then A is 
said to be a retract ofX. (See Figure 9.9.) 



-r(x) g A 


r(a) = a 



FIGURE 9.9: The subset A is a retract of X. 
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EXAMPLE 9.4. If A is a one-point subset {a} of a topological space X, then 
A is a retract of X since the function mapping each x in X to a is a retraction. 


-i 

EXAMPLE 9.5. The circle S 1 is a retract of R 2 — {O}. Using (r, 0) polar 
coordinates, the function / : R 2 — { O } — > S 1 , defined by /(r, 0) = (1, 9), is 
a retraction. 

_i 


EXAMPLE 9.6. Consider the annular region A in the plane, shown in Fig¬ 
ure 9.10. It is the region centered at the origin, running from the circle C\ of 
radius 1 to the circle C 2 of radius 2, inclusive. Individually, C\ and C 2 are 
retracts of A. However, together C\ U C 2 is not a retract of A because the 
continuous image of a connected space must be connected. 



FIGURE 9.10: Individually, C 1 and C 2 are retracts of the annulus, but Ci UC 2 is not. 


j 


In the previous example, the fact that the circles C\ and C 2 are retracts 
of the annulus A is a special case of the next example, which shows that the 
spaces X and Y can naturally be seen as retracts of the product space X x Y. 


EXAMPLE 9.7. Let topological spaces X and Y be given, and consider the 
product space X x Y. Assume x* € X. Then the subset A = {x*} x Y of 
X x Y is a retract via the retraction r : X x T -* A given by r(x, y) = (x*, y). 
Therefore, the product space X x Y contains subspaces homeomorphic to Y 
that are retracts (and similarly contains subspaces homeomorphic to X that are 
retracts). 

For example, within the torus there are circle subspaces that are retracts of 
the torus. (See Figure 9.11.) 
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Now, consider the disk D in the plane. We can believe that there is no 
retraction from D to its circle boundary, just as we might expect that it is not 
possible to deform the entire skin of a drum to its rim, keeping the points on 
the rim fixed, but not tearing the skin. (See Figure 9.12.) 



FIGURE 9.12: Deforming the skin of a drum to the rim will tear it, demonstrating 
that there is no retraction from D onto S 1 . 

In fact, there is no retraction of the n-ball B n in R" onto its (n — l)-sphere 
boundary, S" -1 . This general result is known as the No Retraction Theorem. 
Here we prove the one- and two-dimensional versions of the No Retraction 
Theorem. With the use of appropriate tools from algebraic topology, the general 
result is straightforward. 

THEOREM 9.12. The One-Dimensional No Retraction Theorem. 

There is no retraction from B l = [—1,1] onto its sphere boundary, S° = 

{- 1 , 1 }. 


Proof. That there is no retraction from [—1, 1 ] onto {—1, 1} is immediate 
since a retraction from [—1,1] onto {—1,1} would be a continuous func¬ 
tion from a connected space onto a disconnected one, an impossibility. 


THEOREM 9.13. The Two-Dimensional No Retraction Theorem. 

There is no retraction fmm the disk D onto its circle boundary, S l . 
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Proof. We prove this by contradiction. Thus assume that there exists a 
continuous function F : D -> S 1 such that F(6) = 6 for all 6 e S l . 
Such a function F is a continuous extension of the identity function 
id : S 1 -> S 1 , defined by id(6) = 0. The function id has degree 1, and 
therefore Theorem 9.9 implies that id cannot be extended to a continuous 
function defined on D, a contradiction. Thus there is no retraction from 
DontoS 1 . ■ 


EXAMPLE 9.8. Theorem 9.13 implies that there is no retraction from R 2 
onto S 1 , because such a retraction, when restricted to D, would then provide a 
retraction from D onto S 1 . 


We revisit the No Retraction Theorem in Chapter 10, where we show that 
it is equivalent to the Brouwer Fixed Point Theorem. 

Next we prove a general existence theorem about retractions. Even though 
there is no retraction from the disk D onto S 1 , if A c D is an arc, then no 
matter how much A winds around in the disk, A is a retract of the disk. (See 
Figure 9.13.) The proof of this result uses the Tietze Extension Theorem, 
which was established in a set of supplementary exercises in Section 7.3. 



FIGURE 9.13: Every arc A in D is a retract of D. 


THEOREM 9.14. Let D be the disk, and let A C D be an arc. Then A is a 
retract of D. 

Proof. Since A is an arc, there is an embedding / : [—1,1] -» D 
mapping onto A. The interval [—1, 1] is compact, and therefore so is 
A, implying that A is a closed subset of the Hausdorff space D. By 
the Tietze Extension Theorem, the function / -1 : A -» [—1,1] has a 
continuous extension g : D -> [— 1, 1]. The function / o g : D —► A is 
a retraction from D onto A. ■ 

Exercises for Section 9.2 

9.9. Determine the degree of each of the following circle functions: 

(a) The antipodal map, A : S l —> S 1 , A(6) = 6 + n, that maps each point 
on the circle to the point opposite it through the center. 
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(b) A function that wraps the circle around itself once in one direction and 
then back around once in the other direction, 


f-S 


1 


s',/(0) = 


29 

-29 


for 9 e [0, it], 
for 9 € [7T, 27T]. 


9.10. Prove Corollary 9.8: Let F : S ' 1 x [a,b] -> S 1 be continuous. If for each r g 
[a, b] we let F r : S 1 —>• S l be the circle function defined by F r (9) = F(9, r ), 
then the degree of F r is independent of r. 

9.11. Prove Theorem 9.10: If a circle function / : S 1 S 1 has a nonzero degree, 
then / is surjective. (Hint: Prove the contrapositive.) 

9.12. Let X be a connected Hausdorff space and B be a two-point subset of X. Prove 
that B is not a retract of X. 

9.13. Let X be a Hausdorff space and A be a retract of X. Prove that A is a closed 
subset of X. 

9.14. Illustrate how the infinite spiral S in Figure 9.14 is a retract of the plane. 



FIGURE 9.14: The spiral S is a retract of the plane. 


9.3 An Application to a Heartbeat Model 

In this section we examine a model for stimulating a beating heart. We use de¬ 
gree theory to draw some interesting conclusions about how the heart responds 
to stimuli of varied strengths, applied at different times during the beat cycle. 
This model is adapted from a paper, “Sudden Cardiac Death: A Problem in 
Topology,” by Arthur Winfree (1942-2002), a theoretical biologist who was 
well regarded for his work on mathematical modeling of biological systems. 

We model the time in the beat cycle by a variable 9 on a circle (see Fig¬ 
ure 9.15), and we assume that the heart beats at 6 = 0. We can think of a clock 
hand rotating around the circle, and each time it passes by 6 = 0 , the heart 
beats. Notice that this means we are assuming a beat interval of 2n , for our 
convenience. 

A stimulus is applied to the heart during its beat cycle and we wish to 
examine the response of the heart to the stimulus. The response is measured 
as the time it takes the heart to beat again after the stimulus is applied; this 
variable is known as the latency and is denoted L. 
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FIGURE 9.15: Time in the beat cycle is modeled by a circular variable 0. 

The time in the beat cycle at which the stimulus is applied is called the 
coupling interval; we denote it by c. We consider stimuli of varying strengths 
5 from weak (s = w) to strong (s = g ) and examine L(c, s), the latency as a 
function of coupling interval and stimulus strength. Thus, we have a function 
L : S 1 x [u>, g] S 1 . 

The heart’s response to a weak stimulus is called weak rescheduling, and 
its response to a strong stimulus is called strong rescheduling. We make the 
following reasonable assumptions about weak and strong rescheduling: 

(i) With weak rescheduling, the stimulus is too weak to result in any 
appreciable change in the beat timing, and therefore the latency is 
the time remaining in the beat cycle after the stimulus. 

(ii) With strong rescheduling, the stimulus is strong enough so that 
the heart loses memory of the past, resulting in a latency that is 
independent of the coupling interval. 

These assumptions translate to the following assumed properties of the 
latency L, as illustrated in Figure 9.16: 

(i) L(c, w) = 2rt — c for all c e S 1 . 

(ii) There exists 9* e S 1 such that L(c, g) = 9* for all c e 5*. 



FIGURE 9.16: Latency L(c , s) defined over the domain S 1 x [w, g]. 
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If we restrict L to a particular stimulus strength s*, then the result is a 
circle function L| s i x(i »). If £|si*{.s*) is continuous, then it has a well-defined 
degree. 

The function L| s i x(u; | has degree —1 since it wraps S 1 x {in} around S’ 1 
once in the clockwise direction. Furthermore, L| 5 i X (£) has degree 0 since it 
is a constant function. But Corollary 9.8 implies that if L is continuous, then 
the degree of ^-l^i X { j*} i s independent of s*. Hence, L : S ] x [u>, g] —>■ S 1 
is not continuous; that is, given our assumptions, latency does not depend 
continuously on stimulus timing and strength over a strength interval varying 
from weak to strong. 

Thus, there are discontinuities in the heart’s response to varying stimulus 
timing and strength. If the heart receives a stimulus at the appropriate time 
in the beat cycle, that stimulus can disrupt the beat pattern of the heart in an 
unpredictable way. 

Now, let us examine L : S l x [w, g] -> S 1 further to extract possible 
additional topological consequences of our assumptions. Since L is not con¬ 
tinuous, there is a set of points P c S 1 x [in, g] where L fails to be continuous. 
We consider the situation where this set of discontinuities lies in the interior 
of a disk K in S 1 x [in, g]. (See Figure 9.17.) For example, this occurs if 
there is a single point of discontinuity of L and that point is in the interior of 
S 1 x [in, g]. We claim that on the circle that is the boundary of K, the latency 
function L takes on all values of latencies from 0 to 2n. 



FIGURE 9.17: The set P, of discontinuities of L, lies in K. 


To prove this claim, we make some changes to L and its domain. First, note 
that if A is the set obtained by removing the interior of K from S ] x [in, g], then 
L\a, by assumption, is continuous. To see the next stage of our construction, 
we pictorially flip A so that the inner circle s = in is on the outside and the 
outer circle s = g is on the inside. (See Figure 9.18.) 

Next, we fill in the inner-diameter hole to obtain a new domain A'. Fur¬ 
thermore, we extend L to a continuous function L' by setting L' equal to the 
constant 9* on the filled-in portion of A'. (See Figure 9.19.) Since L' was al¬ 
ready equal to 9* on the inner boundary of A, it follows that L' is a continuous 
function. (Note that if L| s i X { g j was not necessarily constant, but had degree 
0, then by Theorem 9.9, we could still extend L\a to a continuous function L' 
on A'.) 
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FIGURE 9.18: The domain A turned inside out. 



FIGURE 9.19: Extending L on A to L' defined on A'. 

The domain A' is topologically an annulus. Therefore we can apply Corol¬ 
lary 9.8 to conclude that the degree of L'\qk equals the degree of L'\ s=w . As 
we discussed previously, the degree of L'\ s=w is equal to —1. Theorem 9.10 
implies that if we consider L'\qk as a function from the circle to itself, then it 
is surjective. Therefore, as claimed, on the circle that is the boundary of K, 
the function L takes on all values of latencies from 0 to 2n. 

Thus, we see that a quite complicated distribution of responses can arise 
near the set of discontinuities of L. In particular, if L has a single point 
of discontinuity p, and that point is interior to S l x [in, g], then on every 
circle surrounding p, the function L takes on all possible latency values. In 
such a situation, it follows that with very small variations of stimulus strength 
and timing away from p, the whole spectrum of latency responses can be 
experienced. We cannot predict how long it will take in the beat cycle before 
the heart beats again. Anything can happen. 

The topological properties of the model presented here provide possible 
explanations for the cause of fibrillation, an often fatal disorganization of the 
heart’s contraction pattern. George Mines, a McGill University experimental 
researcher of fibrillation, met an unfortunate death at the age of 28 in 1914 
when he chose to be his own experimental subject. The laboratory janitor 
discovered Mines, unconscious, with the experimental apparatus attached to 
his chest, the heartbeat monitor recording his failing heartbeat. Mines died 
later that evening. In his final paper, Mines had hypothesized that fibrillation 
can be triggered by an untimely electrical impulse. The topological properties 
of the model just presented, as well as the fatal results of Mines’s experiment, 
certainly support his conjecture. 
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Exercises for Section 9.3 

9.15. This exercise provides an alternative approach to showing that the latency func¬ 
tion L : S l x [to, g] -*■ S 1 is not continuous. Assume that L is continuous, and 
from it construct a continuous function on the disk in the plane, / : D —* S 1 , 
such that the restriction to the boundary circle,/| s i : S 1 -» S 1 , has degree-1. 
Use Theorem 9.9 to derive a contradiction. 


9.4 The Fundamental Theorem of Algebra 

We have previously seen the Intermediate Value Theorem and the Extreme 
Value Theorem, two important applications of topology in analysis. In this 
section, we show how degree can be used to prove an important basic theorem 
in algebra, the Fundamental Theorem of Algebra. 

The Fundamental Theorem of Algebra asserts that every polynomial equa¬ 
tion 


a n z n +a„-iz n 1 + ... +a\z + ao = 0 (9.1) 

has a solution within the complex plane, C. It is surprising that we can prove 
such a fundamental algebraic result using the tools of topology. 

First, note that since we can divide Equation 9.1 through by a n , we only 
need to consider equations of the form 

z n + a n -\z n ~ [ + ... + a\z + ao = 0. 

This equation has a solution in C if and only if for each positive real number 
c, the equation 


z" + —z"- 1 + ... + ■ 


a l 


T z + ~Ji =0 

l c n 


has a solution in C. (See Exercise 9.16.) By choosing a c large enough, we 
can ensure that 


a n— 1 
C 




< 1 . 


Therefore, to establish the Fundamental Theorem of Algebra, we only need to 
consider equations of the form 


z n + b n -\z n 1 4- ... + b\z + bo = 0, 

with \b n -\| + ... + \bo\ < 1. These equations are the subject of the following 
theorem: 


THEOREM 9.15. If\b n -i\ + ... + |£>o| < 1, then the equation 
z n + b n -\z n ~ l + ... + biz + bo — 0 


has a solution in C. 
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Proof. We establish this result by contradiction. Assume that 

z n + b n -\z n ~ x + .. ■ + b\z + bo = 0 (9.2) 

has no solution within the complex plane. Let D be the disk and S ] be 
the circle, here considered as subsets of the complex plane. Since Equa¬ 
tion 9.2 has no solution, we can define a continuous function 
F : D -*■ C — {0} by 

F(z) = z n + b n -\z n ~ ] + ... + b\z + bo- 

The function F is an extension of the function / : S 1 -» C — {0}, where 
Hz) = F(z). 

We define new continuous functions g : S 1 —»• S 1 and G : D -> S 1 
by g(z) = and G(z) = The function G is an extension of 

g ; therefore Theorem 9.9 implies that g has degree 0. 

We claim that g is homotopic to c n (z) = z n . The function c n is the 
function that we previously expressed as c n (0) = n0 \here it is expressed 
as a function of the complex variable z, rather than as a function of the 
angle 0. Given the claim, we have a contradiction since g has degree 0 
and c n has degree n / 0. 

To prove the claim, define H : S l x [0, 1] -> S 1 by 

H( t) _ (z n +t(b n -]z n ~ 1 +... +friz + fro)) 

\z n + t(b n -]z n ~ l + ■ ■ ■ + b\z + &o)l’ 

To establish that H is well defined, we need to show that for all (z, t) e 
S l x [0, 1], 


\z n + t(b n ~iz n 1 + ... + b\z + £o)l > 0. (9.3) 

Since t e [0, 1], |&„-i| + ... + |(?ol < 1. and |z 7 | = 1 for all z e S* and 
j e Z+, we have 

|z "+ ...+b\z + bo)\ > |z n | ~t\b n _iz n ~ l +... + b\z + b 0 \ 

> 1 - t{\b n _\z n ~ x | + ... + \b\z\ + |A 0 |) 
= 1 -/(iV-il + ... + |f>ol) 

>1 - t 

> 0 . 

From the expression for H(z, t ) and Inequality 9.3, it follows that H is 
a well-defined continuous function mapping S 1 x [0,1] to S 1 . Further¬ 
more, H{z, 1) = jjyyyy = g(z) and H(z, 0) = i^j = z", with the latter 

equality holding because |z”| = 1 for all z e S l . Therefore, H is a 
homotopy between g and c n , and the proofs of the claim and the theorem 
are complete. B 
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Exercises for Section 9.4 

9.16. Show that the equation 

z" +a n - iz” _1 + ... +a\z + ao = 0 


has a solution in C if and only if, for each positive real number c, the equation 


z n + ■ 


-z"-‘ + ...+ 


a\ 

r n —1 


z + ^ = o 


has a solution in C. 

9.17. Provide justifications for each step in the following portion from the proof of 
Theorem 9.15: 


\z n +t(b n _iz n 1 + ...+b\z + b 0 )\ > \z n \-t\b n -\z n 1 + ...+b\z + bo\ 

> l-t(\b n - 1 z n ~ l \ + ... + \b l z\ + \bo\) 
= 1- t(\b n _i\ + ... + |*o|) 

>1 -t 

> 0 . 


9.18. (a) Prove that if c is such that 

I a n—\ I 
I C I 

then all of the solutions to 




< 1, 


z" +a„_iz" 1 + ... +aiz + ao — 0 


satisfy |z| < c. (Hint: Show that if |z| > c, then 


a n -\ + a\ 


ao 

z" _1 z" 


< 1, 


and, from that, prove that no such z can be a solution to the equation.) 

(b) Show that all solutions to the equation 

z" +z n_1 + ... + Z+ 1 =0 

lie in the open disk of radius 2 centered at the origin in the complex plane. 


9.5 More on Distinguishing Topological Spaces 

In this section we extend results from Section 6.2 on distinguishing between 
topological spaces. In Section 6.2, we used the concept of connectedness to 
prove that a variety of spaces are not homeomorphic to each other. Here, we 
use the concept of simple connectedness in a similar way. 

DEFINITION 9.16. A path connected topological space X is called simply 
connected ifevery continuous function f : S 1 —► X is homotopic to a constant 
function. 
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If we think of a continuous function / : S 1 -> X as a loop in X, a path 
connected topological space is simply connected if every loop in X can be 
deformed to a point in X. (See Figure 9.20.) 



FIGURE 9.20: The space X is simply connected if every loop in X can be deformed 
to a point. 


EXAMPLE 9.9. It is not difficult to see that the plane R 2 is simply connected. 
Every continuous function / : S 1 -»• JR 2 is homotopic to a constant function 
sending S 1 to the origin. Such a homotopy is accomplished by using the 
straight-line homotopy F : S l x / -*■ R 2 , defined by F(x, t) = (1 — t)f(x). 
On the other hand, if we remove the origin from the plane, then the resulting 
space M 2 — {O} is not simply connected. A loop wrapping once around the 
origin cannot be deformed to a point in the plane. (See Figure 9.21.) We prove 
this in Theorem 9.19. 



FIGURE 9.21: The plane M 2 is simply connected, but R 2 - [O] is not. 


The following two theorems concerning simple connectedness are straight¬ 
forward. We leave their proofs to the exercises. 

THEOREM 9.17. Simple connectedness is a topological property. That is, if 
X is homeomorphic to Y, then X is simply connected if and only ifY is simply 
connected. 


Proof. See Exercise 9.20. 
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THEOREM 9.18. IfX is simply connected and A is a retract of X, then A is 
simply connected. 

Proof. See Exercise 9.21. ■ 

In Section 6.2, when we showed that the line is not homeomorphic to the 
plane, we did so by showing that removing a point from the line results in a 
disconnected space, while removing a point from the plane does not. The same 
idea works for us here in showing that the plane is not homeomorphic to R 3 . 
Specifically, we prove that removing a point from the plane results in a space 
that is not simply connected, while removing a point from R 3 results in a space 
that is simply connected. These results should be intuitively clear. We work 
out the details next. 

THEOREM 9.19. The space R 2 — {O} is not simply connected. 

Proof. We show that there is a continuous function / : S 1 —> R 2 — {0} 
that is not homotopic to a constant function. To do this, let / be the 
embedding of S 1 onto itself as a subspace of R 2 — {O}. We claim that 
there is no homotopy from / to a constant function. 

Suppose that there is a homotopy F : S 1 x / —>• R 2 — {O} from 
/ to a constant function. Then, if we define G : S 1 x / -»■ S 1 by 
G(x,t) = | ^^’^ | , it follows that G is a homotopy from the identity 
function on the circle to a constant function on the circle. But the identity 
function on the circle has degree 1, and a constant function has degree 0. 
Therefore, by Theorem 9.7 no such homotopy G exists. Thus we have 
a contradiction to our assumption that F exists. It follows that / is not 
homotopic to a constant function, and M 2 — {O} is not simply connected. 


Now, let a be a point in R 3 . Assume that / : S 1 -> R 3 — {a} is continuous. 
It is apparent that there is enough room to deform / to a constant function while 
avoiding the missing point at a. (See Figure 9.22.) We show how to construct 
such a deformation in the proof of the next theorem. 



FIGURE 9.22: There is room in M 3 - {«} to deform / : S 1 -r R 3 - (a) to a constant 
function. 
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THEOREM 9.20. For each a € R 3 , the space R 3 — {a} is simply connected. 

Proof. We prove that if / : S 1 -»■ R 3 — {a} is continuous, then it 
is homotopic to a constant function. We do so by demonstrating a two- 
stage homotopy from / to a constant function. However, we only present 
a description of what takes place under the homotopy, rather than provide 
a specific definition of it. 

To begin, since f(S l ) is compact (and therefore closed) and since 
a g /(.S’ 1 ), there exists an e > 0 such that the open ball of radius e 
centered at a is disjoint from /(S 1 ). Using the compactness of /(S 1 ) 
again, we can cover /(S 1 ) by finitely many open balls of radius e/2. 
Each such open ball does not contain a. 

If we take the preimage under / of each of these open balls, the result 
is an open cover of 5*. By Corollary 7.28, we can subdivide S ] with 

0 = 9o < 6\ < ... < 6 m = 2n 

such that for each i = 0,... , m — 1, the interval [0,-, 0,+i] is contained in 
a set in the open cover of S ! . Therefore each set /([#,, 0/+i ]) is contained 

in one of the aforementioned open balls. For each i = 0. m — 1, 

pick such an open ball and denote it by B,. For each i = 0,... ,m, set 
Si = /(<?,•), and let 5,- be the line segment connecting s; and .v;+]. (Note 
that it is possible that the segment Si is a single point.) Each 5, lies in 
the open ball B,. 

In the first stage of the homotopy, we deform each /([0/,0;+ 1 ]) 
within B; to S,-. (See Figure 9.23.) This deformation can be done so that 
each point st remains fixed throughout the deformation. Since B; does 
not contain a, this deformation takes place in R 3 — {a}. Thus /(S 1 ) has 
been deformed in R 3 — {a} to a closed curve P that is made up entirely 
of line segments. 



FIGURE 9.23: Deforming /([0;,0,- + i]) to 5; within B,. 


For each i — 0,... , m — 1, let A,- be the plane through a, Si, and 
j ;+ i (or the line through a, s;, and s;+i if they are collinear). We can 
choose a point e R 3 such that / does not lie in any of the sets A,-. Note 
that a does not lie in any of the triangles 7} with vertices fi,si, and s I+ 1 . 
(By triangle, here, we include the possibility that s,- = 5,+i.) 
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Now, in the second stage of the homotopy, we deform each line 
segment 5, within 7} to the point p, and the deformations are done so 
that they all glue together, resulting in a continuous deformation of the 
closed curve P to the point f. (See Figure 9.24.) This second stage of 
the homotopy also takes place in R 3 — {a} since a does not lie in any of 
the sets 7). 



FIGURE 9.24: Deforming each S; to p. 

When we combine the two stages of the homotopy, the result is a 
homotopy from / to a constant function, and it follows that R 3 — {a} is 
simply connected for all a 6 R 3 . ■ 

Now, via Theorems 9.17,9.19, and 9.20 we can distinguish the plane from 
3-space. 

THEOREM 9.21. The plane R 2 is not homeomorphic to 3-space, R 3 . 

Proof. Suppose that there is a homeomorphism / : R 2 —» .R 3 . Since 
R 2 — {O} is not simply connected, it would follow that R 3 — {/(O)} is 
not simply connected as well, a contradiction. Thus no such / exists, 
and it follows that R 2 is not homeomorphic to R 3 . ■ 

The approach we used in proving that R 3 — {a} is simply connected can 
also be used to prove that R" — {a} is simply connected for all n > 3 and all 
a e R". Thus it follows that R 2 is not homeomorphic to R" for all n > 3. 
Furthermore, the function / : R” +1 — {O} —»• S n , given by f(x) = ■j^y, is a 
retraction. Therefore Theorem 9.18 implies that S n is simply connected for all 
n > 2. The circle, S 1 , is not simply connected. (See Exercise 9.22.) Hence, 
S 1 is not homeomorphic to S n for all n > 2. 

Exercises for Section 9.5 

9.19. Prove that a topological space X is simply connected if and only if every con¬ 
tinuous function f : S l —> X extends to a continuous function F : D -*■ X 
where D is the disk in the plane. 

9.20. Prove Theorem 9.17: If X is homeomorphic to Y, then X is simply connected 
if and only if Y is simply connected. 
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9.21. (a) Prove Theorem 9.18: If X is simply connected and A is a retract of X, 

then A is simply connected. 

(b) Provide an example showing that if A is a retract of X and A is simply 
connected, then X need not be simply connected. 

9.22. Prove that S' 1 is not simply connected. (Hint: Use an approach similar to that 
used in the proof of Theorem 9.19.) 

9.23. Show that X x Y is simply connected if and only if X and Y are simply con¬ 
nected. 


9.6 More on Degree 

In this section we prove Theorem 9.5, the main result establishing the defi¬ 
nition of degree and the invariance of degree under homotopy between circle 
functions. In our view of S 1 in Sections 9.2 and 9.3, we used real values 9 to 
represent points on the circle, understanding that 9 is associated to the point 
with counterclockwise angle 6 from the positive x-axis. In this section, we 
need to reserve real values for the purpose of representing points on the real 
line. In order to avoid confusing the two interpretations, we represent points on 
the circle in their complex exponential format. So e ,d represents the complex 
number cos(0) + sin(0)i on the circle, located at angle 9 from the x-axis. (See 
Figure 9.25.) 



FIGURE 9.25: Representing points on the circle in complex exponential format. 

We show that for every circle function / : S 1 —> S 1 there is a unique 
n € Z such that / is homo topic to the function c n : S l -> S 1 , defined 
by c n (e ld ) = e‘ nd . This c„ is the function obtained by taking the function 
f(z ) = z n , defined on the complex plane, and restricting it to the circle. 
This function also corresponds exactly to the function c n : S 1 -> S l that we 
previously presented as c n (9) = n6. 

In order to prove the desired result, we associate to each circle function 
/ : S 1 -+ S 1 a function /* : [0, 2tt] —»• R that essentially unwraps /. We 
then show that a straight-line homotopy exists between /* and a linear function 
c* : [0, 27t] —»■ R, defined bye* (9) = n9. From that homotopy we then obtain 
the desired homotopy between circle functions. 

Every circle function / : S 1 ->• S 1 can be viewed as a continuous function 
/ : [0, 27r] ->■ S 1 having /(0) = /( 2n). Conversely, given a continuous 
function / : [0, 2n] -> S 1 satisfying /(0) = /(2rr), there is a circle function 
/ : S 1 -> S 1 naturally associated with it. Thus, in this section we work with 
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continuous functions / : [0, 2jt] —>■ S 1 satisfying /(0) = /( 2n). As with 
continuous functions / : 5* —*■ S 1 , we also refer to these functions as circle 
functions. 

Homotopies between circle functions play an important role in the results 
we present here. Such a homotopy, H : [0, 2n] x [0, 1] -> S 1 , must define a 
circle function for each t £ [0, 1]. Thus, we have the following definition: 


DEFINITION 9.22. Circle functions /, g : [0, 2 n] —► S 1 are said to be circle 
homotopic if there is a homotopy H : [0, 2n] x [0, 1] -»■ S 1 between f and g 
such that H( 0, t) = H(2n, t)for all t £ [0, 1]. Such a function H is called a 
circle homotopy. 

A circle homotopy H : [0, 2n] x [0,1] —»• S 1 naturally defines a ho¬ 
motopy H : S 1 x [0, 1] -> S 1 between circle functions that have domain 
S 1 . Furthermore, a homotopy H : S 1 x [0, 1] ^ .S 1 naturally yields a circle 
homotopy. 

Now consider the function p : R -v S 1 defined by p{6) = e ,d . The 
function p maps each interval [r, r + 2n) bijectively around the circle and 
therefore wraps the real line around the circle an infinite number of times. 
(See Figure 9.26.) It is an example of a type of continuous function known as 
a covering map. 



FIGURE 9.26: The function p wraps [r, r + 2n ) around S 1 . 

When we restrict p to an open interval (a, b ) with b — a < 2n, the 
resulting function p\( a ,b) is a homeomorphism onto its image in S 1 . For each 
r £ R, let q r be the inverse of the homeomorphism pl( r , r +2x)- The function 
q r homeomorphically maps S 1 — {e ir }, the circle with the point e ir removed, 
to the interval {r,r + 2n), as illustrated in Figure 9.27. 

The idea behind unwrapping a circle function / : [0, 2 tt] -> S 1 is to 
associate to it a function /* : [0, 2 n] -*■ K, as follows: 


DEFINITION 9.23. Let f : X -*■ S 1 be continuous. A continuous function 
f * : X —>■ IS. is called a lifting of f if P ° f* = f- (See Figure 9.28.) 
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FIGURE 9.27: The functions q r and p\( r ,r+ 2 K) are inverse homeomorphisms. 



FIGURE 9.28: A lifting f* of / satisfies p o /* = /. 


EXAMPLE 9.10. A path wrapping three-quarters of the way around the circle 
counterclockwise from 1 = e‘° has a lifting that is a path in M running from 
0 to 3tt/2. A path wrapping exactly once around the circle clockwise from 
— 1 = e ,7t has a lifting that is a path in R running from n to — n. (See 
Figure 9.29.) 



FIGURE 9.29: Liftings of paths on the circle. 


j 


The following theorem asserts that liftings exist for a function / : X -*■ S ] , 
as long as / is not suijective: 
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THEOREM 9.24. Let f : X -> S 1 be continuous and assume that f(X) is a 
proper subset of S 1 . Furthermore, assume that xq £ X andro e p~ l (f(x o)). 
Then there exists a lifting, /* : X —> M, o// such that f*(x o) = ro. 

Proof There exists s £ S 1 such that s & f (X). We can choose 
r £ p~ l (s) such that ro £ (r, r + lit). Let q r be the homeomorphism 
previously described, and define /* : X —>■ R by f*(x) = q r (f (x)). It 
follows that f* is a lifting of /, and f*(x o) = ro- (See Figure 9.30.) 


r r + 2tr 



FIGURE 9.30: Lifting a function / : X -> S 1 that is not surjective. ® 

Liftings are not uniquely determined. Since p(0) = p{9 + 2nri) for every 
0 e B and n e Z, it follows that if f* is a lifting of / : X S\ then for 
every n £ Z the function f*:X—> R, defined by /„*(*) = /*(x) + 27rn, is 
also a lifting of /. In the case where X is connected, this accounts for all of 
the possible liftings, as the following theorem indicates: 

THEOREM 9.25. Let f : X —»• S 1 be continuous and X be connected. If 
g, h : X -> R are liftings of f, then there exists n £ Z such that g(x) — h{x) = 
2 nnfor all x £ X. 

Proof. Let* £ X be arbitrary. Then p(g(x )) = f(x) = p(h(x)), and 
therefore there exists n x £ Z such that g(x) — h(x) = 2nn x . Define a 
function k : X -> R by &(x) = g(x) — h(x). The image of k is the set 
{2 nn x \x £ X}, and this set must be connected since k is continuous and 
X is connected. Thus, the set [2nn x | x £ X] is a connected subspace of 
R, implying that n x = n y for each x,y € X. Hence, there exists n £ Z 
such that g(x) — h(x) = 2nn for all x £ X. ■ 

The next theorem is a straightforward consequence of the previous one. It 
indicates that if X is connected and two liftings agree at one point in X, then 
they agree at every point in X. 

THEOREM 9.26. Let f : X —> S 1 be continuous, X be connected, and 
g,h : X —> R be liftings of f. If there exists xo £ X such that g(xo) = h(x o), 
then g(x) = h(x)for all x € X. 


Proof. See Exercise 9.25. 
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Now, we are ready to unwrap circle maps. The idea is to show that for 
every circle map / : [0, 2n] —r S l , there exists a lifting f* : [0, In] -> R. 
We establish a slightly more general lifting result that holds for all continuous 
functions / : [0, 2n] —*■ S 1 , not just for circle functions. 

THEOREM 9.27. Let f : [0, 2n] —> S 1 be continuous. Then there exists a 
function f* : [0, 2n] —R that is a lifting of f. 

Proof. Recall that the complex numbers e‘° and e in equal 1 and — 1, 
respectively. Let U = S l — {1} and V = S 1 — {—1}. The open sets U 
and V together cover S 1 , and therefore {/“'(I/), /“' (V)} is an open 
covering of [0, 2 n]. Corollary 7.28 implies that there is a subdivision 
0 = do < 0] < ... < 9 n = 2n of [0, 2 tt] such that for all j = 1,... ,n, 
the interval [9j~i, 9j] is contained in either f~ l (U) or f~ l (V). For 
each interval [9j-\,9j] the restriction f\[e j _ l ,e J ] is not surjective, and 
therefore Theorem 9.24 implies that there is a lifting of /|[e,_ 1 ,e;]- The 
goal is to build these liftings up, step by step, so that they properly join 
together to form the desired lifting /*. 

To begin, let g\ : [0o, 0i] -* R be a lifting of f\[o th e,\- Set /*, 
the first part of our lifting, equal to gy. Note that p{g\{9\)) — f{9\), 
and therefore gi (0i) € p~ { if {&{))■ Thus, by Theorem 9.24 there exists 
82 ■ [6*1,02] -> R, a lifting of f\]e u e 2 \ with g 2 (9y) = gi(0i). By the 
Pasting Lemma we can paste the functions gi and g 2 together to obtain 
a continuous function ff : [0o, 02] -> R. The function ff is a lifting of 
/|[0o,e 2 ]- We can continue this construction, and the function /„* obtained 
at the nth step is the desired function /*. ■ 

If / : [0, 2 tt] -> 5 1 is a circle map, then by definition /(0) = f(2n). 
However, it need not be that /*(0) = f*(2n) for a lifting /* : [0, 27 t] -h>- R 
of /. In fact, the difference between f*(2n) and /*(0) is related to the 
degree of /; specifically, f*(2n) — /*(0) = 2 n deg(/), as we will show in 
Theorem 9.31. 

Now we are prepared to prove the existence claim of Theorem 9.5. Let 
c n : [0, 27t] -4- S 1 be the circle function defined by c„(0) = e ind . 

THEOREM 9.28. Assume that f : [0, 2n] ->■ 5 1 is a circle function. There 
exists n e Z such that f is circle homotopic to c n . 

Proof. Let f* : [0, 27r] -» R be a lifting of /. Since /(0) = f(2n), 
it follows that p(f*( 0)) = p(f*(2n)), and therefore there exists n e Z 
such that f*(2n) — /*(0) = 2nn. 

Consider the straight-line homotopy H : [0, 2n] x [0,1] R, 
given by H(9, t) = (1 — t)f*(9) + tn9. The function H is a homotopy 
between f* and the function c* : [0, 2n] —> R, defined by c*(9) = n9. 
Define G : [0,2 n] x [0,1] —S 1 by G(x, t ) = p(H(x, t )). We claim 
that G is a circle homotopy between / and c n . 
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To begin, we need to show that G(0, t) = G(2it, t) for all t e [0,1]. 
Thus let t e [0,1] be fixed. Note that H( 0, t) = (1 — t)/*(0) and 
H(2tv, t) = (1 — t)f *(2n) 4- tn2jt = (1 - t){2nn + /*(0)) + t2;rn = 
(1 - t)/*(0) + 2 nn = H( 0, t) + 27tn. 

Thus H( 0, t) and H(2tt, t) differ by 2nn, and it follows that 
p(H(0,t )) = p(H(2jt,t)). Therefore G(0, t) = G(2n, t), implying 
that G is a circle homotopy. 

Now, G(0,0) = p(f*{6)) = f(0) and G(0, 1) = P (n0) = e ind . 
Thus G is a circle homotopy between / and c„. ■ 

The last step in establishing Theorem 9.5 is to prove the uniqueness result. 
That is, we need to show that if / is circle homotopic to c„ and c m , then 
n = m. It suffices to prove that if c n is circle homotopic to c m , then n = m. 
To accomplish this, we need another lifting result, this one for homotopies. 

THEOREM 9.29. Let H : [0, 2n] x [0,1] -» S 1 be continuous. Then there 
exists a function H* : [0,2 n] x [0, 1] —> R that is a lifting of H. 

Proof. The approach here is similar to the one used in proof of The¬ 
orem 9.27. Again, let U = S 1 — {1} and V = S 1 — {—1}. Then 
{H~ l (U), //“'(V)} is an open cover of [0,2?r] x [0, 1]. As a conse¬ 
quence of /, the Lebesgue Number Lemma, we can subdivide 
[0, 2 tt] x [0, 1] with 

0 = 0o < $1 < ... < 0 n =2n and 
0 = to<ti < ... <t m - 1 

such that each rectangle [#,_i,0,] x [tj-\,tj] is contained in either 
or H~\V). By Theorem 9.24, we can obtain liftings of H 
restricted to each of these rectangles. The task is to properly choose 
these liftings and paste them together to obtain the desired lifting H*. 

To simplify the notation, let j be the rectangle [0,-_i, 0;] x [tj~\, tj]. 
Begin by defining Gi,i : i?i,i —► R to be a lifting of //|«, r We also 
denote G 1,1 by//*,. Note that Gi,i(0i, to) e P~ l (H(9\,to)). Next, 
let G 2 ,i : /? 2,1 R be a lifting of H\r 21 satisfying G2,i(0i,*o) = 
G\,\(6\, to). At this point we have liftings of H to each of the rectan¬ 
gles /?i,i and /? 2 ,i illustrated in Figure 9.31. These liftings agree at the 
point (0i, to), but we need to have them agree over the entire intersection, 
/f 1,1 fl /? 2 ,i, in order to be able to paste them together to have the lifting 
defined over /?i,i U /? 2 ,i- 

We claim that, in fact, they do agree over /?i, 1 D /? 2 , 1 . The intersec¬ 
tion is the set S = {6\} x [to, f 1 ]. The restrictions G 1 , 1 1s and G 2 ,i Is are 
both liftings of //|$, and they agree at the point (0 \, to). Theorem 9.26 
implies that Gi,i|s = G 2 ,i|s, and therefore Gi,i and G 2 ,i agree over 
R\ 1 n /?2,i- Thus we paste G],i and G 2 ,i together to obtain a lifting 
//* 1 of//i / f 1-lU /t 2 , 1 . 
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FIGURE 9.31: The homotopy H is lifted one rectangle at a time. 

We continue this construction, building H* across the bottom row of 
rectangles in the partition of [0, 2n] x [0, 1], then across the next row, 
and so on. The function H* m is the desired lifting of H. U 


Next we have the uniqueness result establishing that degree is well defined. 


THEOREM 9.30. Ifc n is circle homotopic to c m , then n = m. 

Proof. Let G : [0, 2n ] x [0, 1] —S 1 be a circle homotopy with 
G(8, 0) = c n (9) and G(6, 1) = c m {9). By Theorem 9.29 there exists a 
function G* : [0, 2;r] x [0, 1] -* IR that is a lifting of G. 

Since G(8, 0) = c„{6), it follows that G*|[o,2tt]x{0) is a lifting of 
c n . The function c* : [0,27r] -> M, defined by c*(9) = n6, is also a 
lifting of c„. Theorem 9.26 implies that there exists d e Z such that 
G*(8, 0) = n9 + 2nd for all 9 € [0, 27t]. Therefore 

G*(2n, 0) - G*(0, 0) = 2nn + 2nd - 2nd = 2 nn. (9.4) 

Similarly, we obtain 


G*(2n, 1) - G*(0, 1) = 2 nm. (9.5) 

Now, G being a circle homotopy implies that the functions G | {oj x [ 0 , l] 
and G|{ 2 ^}x[ 0 ,i]. considered as functions mapping [0, 1] to S 1 , are identi¬ 
cal. Therefore G* \ {oj x [0, l] and G* \ \ 2 n ) x [0, l] are liftings of the same func¬ 
tion. Theorem 9.25 implies that there exists be Z such that G* (27r, t) = 
G*(0, t) + 2nb for allt e [0,1], It follows that G* (2n, 1)-G*(0, 1) = 
G*(2n,0) — G*(0,0), which, by Equations 9.4 and 9.5, implies that 
2nm — 2nn, and therefore m = n. ■ 


Through Theorems 9.28 and 9.30 we have proven Theorem 9.5, establish¬ 
ing that degree is a well-defined property of circle functions. By a proof similar 
to the proof of Theorem 9.30, we can establish the following theorem relating 
degree to the endpoints of liftings of circle functions: 
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THEOREM 9.31. Let f : [0, 2ti] —> S l be a circle function, and let 
f* : [0, 2n] —»• M be a lifting of f. Then f*(2n) — /*( 0) = 2n deg(f). 

Proof. See Exercise 9.26. ■ 

Exercises for Section 9.6 

9.24. Show that if we drop the hypothesis that X is connected in Theorem 9.25, then 
the result need not hold. That is, find a continuous function / : X —> S\ with 
disconnected domain X, such that / has liftings g, h : X -*■ R that do not 
differ by a constant. 

9.25. Prove Theorem 9.26: Let / : X —*■ S l be continuous, X be connected, and 
g, h : X —► R be liftings of /. If there exists xq € X such that g(xo) = h(x o), 
then g(x) = h(x) for all x e X. 

9.26. Prove Theorem 9.31: Let / : [0,27r] —*■ S 1 be a circle function, and let 
/* : [0, 2n] -+ R be a lifting of /. Then f*(2n) - /*(0) = 2 n deg(/). 
(Hint: Let G be a circle homotopy between / and c„ where n = deg(/), 
G(6, 0) = / (0), and G(0, 1) = c„{0). Let G* be a lifting of G. Argue as in 
the proof of Theorem 9.30 and show that G*(27r, 0) — G'*(0,0) = Inn. Then 
use Theorem 9.25 to obtain the desired result.) 

9.27. Prove that if / : S l S 1 is a homeomorphism, then deg(/) = ±1. 


Supplementary Exercises: The Borsuk-Ulam Theorem and 
the Ham Sandwich Theorem 

We introduced the Borsuk-Ulam Theorem in Section 6.3, where we proved that if 
/ : S 2 -*■ K is continuous, then there exists x e S 2 such that fix) = /(—x) (viewing 
x e S 2 as a unit vector based at the origin in R 3 ). We indicated then that we could do 
better, specifically that the same re.sult holds with range R 2 rather than R. As mentioned 
in Section 6.3, this implies that there must be a point on the surface of the Earth that 
has exactly the same temperature and barometric pressure as its antipodal point. We 
are now in a position to prove this stronger result; we do so through the following 
exercises. 

A circle function / : S 1 —► S 1 is said to be antipode-preserving if / maps antipodal 
points to antipodal points. That is, if we view / as complex-valued, then for all 
complex z in S 1 we have /(— z) = -f(z). (See Figure 9.32.) We can also express 




FIGURE 9.32: Antipode-preserving functions map antipodal points to antipodal 
points. 
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this relationship as /(«' (e+?r) ) = e llT f(e' 6 ). When the circle function is represented 
as a map / : [0, 2n] -*■ S 1 such that /(0) = /( 2n), then to be antipode-preserving / 
must satisfy f(8 + n) = — f(8 ) = e ,7r f(9) for all 8 e [0, n\. 

This requirement yields /( 2n) = —/(n-) = — (—/(0)) = /(0). Of course, f(2n) = 
/(0) is required of circle functions by their definition, and therefore the requirements 
for being a circle function and for being antipode-preserving are consistent with each 
other. 

Assume that / : [0, 27 t] -*■ S 1 is an antipode-preserving circle function, and let 
/* : [0, 2;r] —> R be a lifting of /. 

SE 9.28. Prove that there exists ns Z such that /*(0 + n) = f*(8) + (2 n + l)n for 
all 8 s [0, jr]. 

SE9.29. Prove that f*(2n) — /*(0) ^ 0, and conclude that antipode-preserving 
circle functions have nonzero degree. 

Now we want to expand the concept of antipode-preserving to include functions 
/ : 5" —*■ S m . In order to do so most conveniently, we represent points in S n us¬ 
ing unit vectors x based at the origin in ® n+1 . In this case, the requirement for a 
function / : S n -*■ S m to be antipode-preserving is that /(—x) = — f(x) for all 
^ s S n . 


THEOREM 9.32. There is no continuous antipode-preserving function f : S 2 -> S 1 . 


SE 9.30. Prove Theorem 9.32. (Hints: Assume that there is an antipode-preserving 
function / : S 2 -» 5* and let /o be the circle function obtained by restricting 
/ to the equator. Show that you can apply Exercise SE 9.29 to fo . Then restrict 
/ to the upper half of S 2 , including the equator, and argue that you can apply 
Theorem 9.9 to fo. You should be able to derive a contradiction resulting from 
these two views of fo.) 


THEOREM 9.33. The Borsuk-Ulam Theorem. Let f : S 2 -*■ R 2 be continu¬ 
ous. There exists x € S 2 such that f(x) = f (—x). 

SE 9.31. Prove Theorem 9.33. (Hint: Assume no such x exists and consider the 
function defined by h(x) = •) 

Along with the simple meteorological application already mentioned, another interest¬ 
ing consequence of the Borsuk-Ulam Theorem is the Ham Sandwich Theorem. This 
states that if you have three solids in M 3 , then there is a plane in M 3 that simultaneously 
cuts each solid in half. (See Figure 9.33.) Thus if you make a sandwich from a piece of 
bread, a slice of ham, and another piece of bread, then no matter how you arrange the 
bread and ham, you can cut through the sandwich with a single, straight knife cut and 
divide each piece of bread and the ham exactly in half. The Ham Sandwich Theorem 
plays a role in fair-division problems in applications of mathematics in the social sci¬ 
ences. These problems address, for example, the fair distribution of resources among 
multiple parties vying for them. (See [Hil], for example.) 
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FIGURE 9.33: There is a plane that cuts each of the three solids in half. 

We now discuss an approach to proving the Ham Sandwich Theorem. Let cri, < 72 , and 
<73 be three solids in R 3 . By “solid” we mean a bounded set with a well-defined volume, 
a notion from measure theory whose details we leave unspecified. We define a function 
d : S 2 -> R 2 that is used to prove the existence of a plane bisecting each of the solids. 
Each point x in S 2 is treated as a unit vector in R 3 , describing a direction in R 3 . Fix 
x e S 2 . Consider all of the planes that have x as a normal vector. 

SE 9.32. Discuss why there must be a plane Pi(x), with normal vector x, that cuts 
solid eri in half. (Hint: For each plane P with normal vector x, consider the 
quantity Vp that indicates the percent of the volume of a\ that lies on the side 
of P in the direction of the normal vector x.) 

Similarly, there must be planes P^(x) and P^(x) with normal vector x that cut solids 
<72 and <73 in half, respectively. We show that there must be an x* e S 2 such that 
P\(x*) = Pi(x*) = Pt,(x*), and therefore this common plane simultaneously cuts 
each solid in half. 

Let d\ (x) be the distance from plane P^{x) to plane Pi (x) if plane Pi (x) lies on the 
side of P-)(x) in the direction of the normal vector x, and let d\ (x) be the negative of 
the distance from plane P}(x) to plane Pi (x) if plane P\(x) lies on the side of Pj(x) 
opposite the direction of the normal vector*. Define di{x) similarly in terms of the 
distance from P^(x) to Pi(x). We assume that the function d : S 2 -*■ R 2 defined by 
d(x) = (d i (*), <^ 2 (*)) is continuous. 

SE 9.33. Prove that d(—x) = —d(x) for each * e S 2 . 

SE 9.34. Use the Borsuk-Ulam Theorem, in conjunction with Exercise SE 9.33, to 
prove that there exists x* e S 2 such that d(x*) = (0,0). Show that the Ham 
Sandwich Theorem follows. 


Chapter 10 

Fixed Point Theorems 
and Applications 


Let / : X -> X be a function mapping a topological space X to itself. If 
f(x) — x for some x in X, then x maps to itself under / and is known as a 
fixed point of /. Fixed points and theorems about them play an important role 
in many areas of pure and applied mathematics. Within topology itself there is 
a large body of work on fixed point results. One of the most well known is the 
Brouwer Fixed Point Theorem. It states that every continuous function from 
an n-ball B n to itself must have a fixed point. We proved the One-Dimensional 
Brouwer Fixed Point Theorem as a consequence of the Intermediate Value The¬ 
orem in Section 6.3. Here, in Section 10.1, we prove the Two-Dimensional 
Brouwer Fixed Point Theorem as a consequence of the Two-Dimensional No 
Retraction Theorem. In Section 10.2, we present an application of the Brouwer 
Fixed Point Theorem to prove the existence of equilibrium price distributions 
in a pure exchange economy. In Section 10.3, we present a generalization of 
the Brouwer Fixed Point Theorem, known as Kakutani’s Fixed Point Theo¬ 
rem, which applies to set-valued functions. Finally, in Section 10.4, we apply 
Kakutani’s Fixed Point Theorem to prove the existence of Nash equilibria in 
game theory. This is one of the most important results in the field of game 
theory, as it demonstrates that there is a choice of strategies that optimizes the 
expected outcome for all players of a game. 

10.1 The Brouwer Fixed Point Theorem 

Imagine taking two pieces of the same-sized paper and laying one piece on 
top of the other. Every point on the top sheet of paper is associated with some 
point right below it on the other sheet. Now take the top sheet of paper and 
crumple it up into a ball without ripping it. Place the crumpled ball back on 
top of the bottom sheet of paper. Somewhere on the crumpled ball of paper 
there is a point that is sitting directly above the same point on the bottom sheet 
of paper that it sat above before the crumpling took place. (See Figure 10.1.) 
This is an application of the Two-Dimensional Brouwer Fixed Point Theorem, 
which we prove in this section. 

DEFINITION 10.1. Let f : X —* X. A point x e X is said to be a fixed 
point of f if f (x) = x. A topological space X is said to have the fixed point 
property if every continuous function f : X —>■ X has a fixed point. 


EXAMPLE 10.1. The space [—1,1] has the fixed point property by the One- 
Dimensional Brouwer Fixed Point Theorem. 
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FIGURE 10.1: After the paper crumpling, some point still sits above the same point 
it sat above prior to crumpling. 


EXAMPLE 10.2. The real line R does not have the fixed point property since 
there exist continuous functions from R to R that do not fix any point. The 
function / : R -> R given by f(x) = x + 1 is an example. 

_i 

-1 

EXAMPLE 10.3. The circle S 1 does not have the fixed point property since 
the continuous function / : S 1 -»■ S 1 , given by rotating each point on the 
circle through an angle of n, does not have any fixed points. 


The fixed point property is a topological property, meaning that if spaces X 
and F are homeomorphic, then X has die fixed point property if and only if Y 
does. (See Exercise 10.3.) Therefore all closed bounded intervals [a, b ] with 
the standard topology have the fixed point property since [—1,1] does. Also, 
all open intervals (—oo, b), ( a , b ), and ( a , oo) with the standard topology do 
not have the fixed point property since R does not. 

One of the most useful theorems in topology is the Brouwer Fixed Point 
Theorem. It says that for each n, then-ball B n has the fixed point property. This 
theorem is named after the Dutch mathematician L. E. J. Brouwer (1881-1966) 
who proved the general n-dimensional version of the theorem in 1912. We show 
here that the Two-Dimensional Brouwer Fixed Point Theorem is equivalent to 
the Two-Dimensional No Retraction Theorem (with a proof that actually carries 
over to establish this equivalence for all n). Since we established the Two- 
Dimensional No Retraction Theorem in Section 9.2, the Two-Dimensional 
Brouwer Fixed Point Theorem follows. 

In a set of supplementary exercises in this section, we present a second proof 
of the Two-Dimensional Brouwer Fixed Point Theorem, one that involves the 
degree of circle functions, but does not rely on the No Retraction Theorem. 

THEOREM 10.2. The Two-Dimensional Brouwer Fixed Point Theo¬ 
rem. Every continuous function f : D -> D, mapping the disk to itself, has 
a fixed point. 

As already indicated, our approach to proving this theorem is to show its 
equivalence to the Two-Dimensional No Retraction Theorem. That is done via 
the following theorem: 
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THEOREM 10.3. The disk D, as a subspace ofM 2 , has the fixed point prop¬ 
erty if and only if there is no retraction from D onto its boundary S l . 

Proof. To begin, assume that there is a retraction r : D -» S 1 . Consider 
the map q : S 1 —> D, defined by q(x) = —x, where we consider x as a 
vector in the plane. The function q o r : D -*■ D is continuous and has 
no fixed point. Therefore, if there is a retraction r : D -+ S 1 , then D 
does not have the fixed point property. 

On the other hand, assume that a continuous function / : D -*■ D 
has no fixed point. We show that there is a retraction r : D -» S 1 . 
Define r : D -> S 1 as follows. First, take the ray in R running from 
f(x) through x. Such a ray is well defined since / has no fixed point. Let 
r(x) be the point where the ray intersects S 1 , as illustrated in Figure 10.2. 
Clearly, r maps D to S 1 and r(x) = x for all x € S 1 . It will follow that 
r is a retraction once the continuity of r is established. 



FIGURE 10.2: Defining r from the disk to the bounding circle. 

To prove that r is continuous, let U be open in S 1 and x be a point 
in r~ l (U). We show that there is an open set V containing x such that 
r(V) C U, and therefore r~ ] (U) is open. To begin, we choose small 
open balls 0\ and O 2 centered at f(x) and x, respectively, such that 
every ray beginning in 0\ and passing through O 2 intersects S 1 in the 
set U. (See Figure 10.3.) Since / is continuous, we can find an open set 
V, containing x and contained in O 2 , such that f(V) C 0\. Thus, for 
all v e V, the ray beginning at f(v) and passing through v intersects S 1 



FIGURE 10.3: There exists an open set V containing x such that f(V) c U. 
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in U. Therefore r(v) e U, and it follows that r is continuous. Hence 
we have established that if D does not have the fixed point property, 
then there is a retraction r : D S 1 , and the proof of the theorem is 
complete. ■ 

The Two-Dimensional No Retraction Theorem and Theorem 10.3 im¬ 
mediately imply Theorem 10.2, and therefore we have established the Two- 
Dimensional Brouwer Fixed Point Theorem. 

The approach to the proof of Theorem 10.3 carries through if we replace 
the disk D and the circle S 1 with the n-ball B n and the (n — l)-sphere 5" _1 , re¬ 
spectively, and in this way we can show that the n -Dimensional Brouwer Fixed 
Point Theorem is equivalent to the n -Dimensional No Retraction Theorem. 


EXAMPLE 10.4. Suppose we take a map of New England and place it on 
the ground anywhere within New England, as in Figure 10.4. We assume that 
New England is topologically equivalent to a disk, and we refer to it as N. Let 
/ be the function assigning to each point in New England the point on the map 
corresponding to it. We can view / as a continuous function from N to itself. 
Therefore / must have a fixed point, from which it follows that there must be 
a point on the map that corresponds exactly to the point on the ground directly 
beneath it. If, as a guide to travelers, we leave the map where it is, then on the 
map we would indicate the location of the fixed point with an arrow labeled 
“You Are Here!” 



FIGURE 10.4: The function from New England to the map must have a fixed point. 
_| 


Exercises for Section 10.1 

10.1. Show that each of the following spaces does not have the fixed point property: 

(a) The interval (0,1] 

(b) The torus 

(c) The figure-eight space obtained by taking two circles and gluing them 
together at a point on each 

(d) The sphere 
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10.2. Prove that if a topological space X has the fixed point property, then X is 
connected. 

10.3. Show that if X has the fixed point property and Y is homeomorphic to X, then 
Y has the fixed point property. 

10.4. Show that if X has the fixed point property and A is a retract of X, then A has 
the fixed point property. 

10.5. Show that if X does not have the fixed point property, then for all Y, the product 
space X x Y does not have the fixed point property. 

10.6. Prove the One-Dimensional Brouwer Fixed Point Theorem from the Two- 
Dimensional Brouwer Fixed Point Theorem. 

10.7. Consider the topological graphs of the letters A-E as illustrated in Figure 10.5. 
Determine which of these spaces has the fixed point property. (Hint: For the 
letter E, you may want to use Exercise 10.4.) 



FIGURE 10.5: 



Which of these letters has the fixed point property? 


Supplementary Exercises: Another Approach to the Two-Dimensional 
Brouwer Fixed Point Theorem 

In these exercises, we develop another proof of the Two-Dimensional Brouwer Fixed 
Point Theorem. In this case the proof is based directly on the degree of circle functions 
and does not rely on the No Retraction Theorem. In what follows, we assume that 
points x in S', D, and so on, are vectors based at the origin in the plane. We begin 
with a lemma. 

LEMMA 10.4. Let g : S 1 -> R 2 — {O} be continuous, and assume that the function 
h : S l —>■ S 1 , defined by h(x) = j ||^| , has degree 1. Then there exists leS 1 such 
that g(x) = ax for some a > 0. 

The lemma indicates that some x e S 1 (again, considered as a vector based at the 
origin) maps to a positive scalar multiple of itself under g. 

Proof. Define G : S l x [0, 1] —*■ R 2 by G(x, t) = (1 — t)g(x) — tx. The 
function G defines a straight-line homotopy from g (considered to have range 
R 2 ) to the function a(x) = —x, which maps each x e S 1 to its antipodal point 
—x e S 1 . 

SE 10.8. Show that there exists (x, r) € 5 1 x (0,1) such that G(x, t) = 0. (Hint: 
Assume that no such point (x, t) exists, and consider the homotopy of circle 
functions H(x, t ) : S 1 x [0, 1] -» S 1 , defined by H (x, t ) = ' y)j .) 
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SE 10.9. Show that if there exists (x,t) e S l x (0, 1) such that G(x,t) = 0, then 
g(x) = ax for some a > 0, thereby proving the lemma. g 

Circle functions do not necessarily have fixed points. A simple rotation of a circle by 
it leaves no point fixed. This rotation function has degree 1. Interestingly, that is the 
only degree for which a circle function can have no fixed point. For circle functions 
whose degree is not equal to 1, we have the following theorem: 

THEOREM 10.5. Let f : S 1 -*■ S 1 be a circle function with deg(f) 1. Then there 
exist x', x" € S 1 such that fix') = x' and fix") = —x". 

Thus if the degree of a circle function is not equal to 1, then the function has a fixed 
point and a point that maps to its antipodal point. 

SE 10.10. Prove Theorem 10.5. (Hint: Regard / : S 1 -* S 1 as a function mapping 
into R 2 — {O}, and use Lemma 10.4 to prove the existence of the fixed point. 
Then consider the function h : S 1 -> S 1 defined by h(x) = f(—x) and apply 
the first assertion of the theorem to h.) 

Now, we proceed with the Two-Dimensional Brouwer Fixed Point Theorem. Let D be 
the disk in the plane. 

THEOREM 10.6. Let f : D —> D be continuous. Then there exists x e D such that 
f(x) = x. 

Proof. Assume that this is not the case. Let k \ D M 2 - { O] be the function 
defined by k(x) = /( x) - x. The function k maps to R 2 - {O} since we are 
assuming that / has no fixed point. 

SE 10.11. Indicate why Lemma 10.4 applies to the function k\ s i : S 1 -*■ R 2 — {0}. 
(Hint: Show that the degree of the circle function involved is 0.) 

Therefore there exists x* e S 1 such that k(x*) = ax* for some a > 0. 

SE 10.12. Show that f(x*) € D, thereby arriving at a contradiction and completing 
the proof of the theorem. g 


10.2 An Application to Economics 

In this section, we show how fixed point theory can demonstrate the existence 
of economic equilibria. Suppose we have an economy consisting of a finite 
number of individuals who have certain items to buy and sell. We assume 
that there is neither consumption of the items nor new production of them. 
Therefore individuals who wish to have more of a particular item must sell 
some of their current stock of other items in order to purchase the item they 
desire. This is called a pure exchange economy. 

The price of the items is determined by supply and demand, what in 1776 
philosopher and economist Adam Smith (1723-1790) called the “invisible 
hand.” If there is more demand for a product than there is supply, the product’s 
price goes up. If there is excess supply and little demand, the product’s price 
goes down. We would like to know whether there is a choice of prices that 
balances supply and demand, an equilibrium point where prices stabilize and 
where all consumers are happy with their particular bundle of goods. 
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In 1932, John Von Neumann (1903-1957) gave a seminar at Princeton 
entitled “On a System of Economic Equations and a Generalization of the 
Brouwer Fixed Point Theorem.” In it, he outlined how fixed point theory 
could be utilized to prove the existence of equilibria in economic models. 
Generalizations and applications of this concept have resulted in Nobel Prizes 
in economics for Kenneth Arrow in 1972 and Gerard Debreu (1921-2004) in 
1983. Applications to game theory, as discussed in Section 10.4, also led to a 
Nobel Prize in economics for the mathematician John Nash in 1994. 

Let us look at a particularly simple example as a starting point. Suppose we 
have an economy with only three items available. These are cashmere, butter, 
and gunpowder. The total quantity of each, in pounds, is called the supply, and 
the three supplies are denoted Sc, Sb, and Sg, respectively. We keep track of 
the supply of each item in a vector called the supply vector S = (Sc, Sb, Sg)- 

There are n individuals trading in this economy. We denote them by the 
integers 1,2,..., n. Each individual starts with some number of each item, 
collectively called his or her bundle of goods. Individual i has a bundle b = 
(b‘ c , b l B ,b l c ), a. three-dimensional vector giving the quantity of each item in 
the corresponding component. At any given time, the sum of all of the bundle 
vectors over the individuals in the economy gives the supply vector, 

S = Eb‘- 

i 

We are assuming that none of the items are consumed or destroyed over time, 
so S is constant. Everyone is simply stockpiling their goods and desirous of 
a particular mix, depending on the going prices. If an individual wants more 
butter and less cashmere, she can sell some of her cashmere to someone else at 
the going price and purchase additional butter. However, if everyone is trying 
to get rid of cashmere, the price of cashmere is driven down. 

We denote the going prices per pound for cashmere, butter, and gunpowder 
by pc, Pb, and po respectively. We then represent this set of prices by a single 
price vector p = ( p c , Pb, Pg)- 

In fact, it is only the relative price of these items that matters. The particular 
cost of any item is irrelevant, since it is the ratio of costs that determines, for 
instance, how much butter can be purchased when a pound of cashmere is sold. 

Therefore we divide each of the individual prices by the sum of the prices 
Pc + Pb + Pg to “normalize” the situation. We denote the resultant prices by 



These normalized prices have the advantage of summing to 1. To simplify 
notation, we drop the primes on the price vectors, but keep in mind our as¬ 
sumption that the prices have been normalized. We then keep track of these 
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vectors in our overall price vector p = {pc, Pb, Pg) such that 

PC + PB + PG = 1. 

In addition, we assume that none of these items is so despised by a consumer 
that he might pay you to take it away. In other words, the prices are never 
negative, so 


Pc > 0 , pb > 0, and pc > 0. 

When the price of an item is 0, there is no demand for that item. Each 
consumer would rather have the other two items, no matter how expensive 
they are. 

We can graph the set of possible price vectors in 3-space. The equation 
Pc + PB + Pg — 1 defines a plane. That pc >0,ps >0, and pc > 0 limits 
us to that part of the plane that lies in the first octant of 3-space, yielding a 
triangle T, as shown in Figure 10.6. Each point in the triangle corresponds to 
a triple of prices, one for each of our three goods. It will be important for us, in 
applying the Brouwer Fixed Point Theorem, that T is topologically equivalent 
to a disk. 



FIGURE 10.6: The triangle T of possible price vectors ( pc, pb, Pg) for our three 
goods. 

Given a particular price vector p, each individual in the economy has a 
certain amount of “wealth” in the form of the value of his or her bundle of 
goods. For the rth consumer, this wealth is given by 

w'(P) = P-b‘. 

An individual’s wealth can change depending on the current price vector, so 
we represent it as a function of p. 

At a particular price vector p, individual f’s current bundle of goods may 
not be her optimal choice. She might want to trade certain goods for others. We 
keep track of her preferences for an optimal bundle at price p with a demand 
vector d' (p), which gives the combination of items she would like to have at 
these prices. We assume she spends all of her wealth in exchanging her bundle 
b 1 to obtain her desired bundle d'(p)- We express this with the following 
equation: 

w'(p) = p b' =p d'(p). 


(10.1) 
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EXAMPLE 10.5. Suppose Wilma has a pound of cashmere, a pound of butter, 
and three pounds of gunpowder. So she has a supply bundle b w = (1, 1, 3). At 
the price vector p = (i, |, |), she has a net worth of p • b w = 5/3. Further¬ 
more, assume that at this price vector, she would prefer a goods distribution as 
given by the demand vector d^p) = (2,2, 1). As required by Equation 10.1, 
this demand vector is constrained so that the total cost of her bundle at these 
prices, p • d^Cp), is equal to her net worth 5/3. 

On the other hand, suppose Dexter has bundle b D = (1,2,4), giving him 
a net worth of 5/2. For the same price vector p, he has a demand vector of 
d D (p) = (6, 2, |). Notice that Dexter seems to like cashmere more than Wilma 
since he wishes to convert proportionately more of his wealth to cashmere than 
she does. 

Now, if the prices change, the demand vectors can change as well. For 
example, if the price vector is p' = (■$)> to)> t * ien Wilma’s net worth 
increases to p' ■ b tv = 2. Seeing such a low price for butter, she may wish to 
stock up on it and choose a demand vector d w/ (p') = (1,11,1). Here too, her 
net worth does not change (it remains at 2), but she has converted much of it 
into a large supply of low-priced butter. 


Each consumer i has a vector-valued demand function d' which maps price 
vectors in T to the consumer’s corresponding demand vector. Adding together 
the individual demand vectors for all of the consumers yields the demand vector 
D(p) for the entire economy at the given price vector p. Thus, 

D(p) = £>(p). 

i 

We assume that this vector-valued function D is continuous in the sense that 
small changes in the price vector cause small changes in overall demand. 

Since p • d ! (p) is the net worth of individual i given price vector p, the dot 
product p • D(p) equals the total wealth of the community given those prices. 
Note that 


P D(P) = d '(P) = X^P b ' = P • S - 

i i 


This yields 


Walras’s Law: p • D(p) = p ■ S. 

This relationship is named for Leon Walras (1829-1910), one of the first 
economists to put the field on a mathematical footing. 

If the coordinate of D(p) corresponding to cashmere is greater than the 
coordinate of the supply vector S corresponding to cashmere, then at this price 
there is more demand for cashmere than there is supply. This will drive up the 
price of cashmere. 
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However, if at the current prices, everyone can obtain their optimal choice 
of goods and would not trade any of their current quantity of each item, then 
the economy is in equilibrium. 

Let Vj denote the jth component of a vector v. 

DEFINITION 10.7. A price vector p is an equilibrium price vector ifDj (p) < 
Sj for all j. 

For such a price vector, the supply of every item exceeds the demand. 
Everyone can obtain their optimal choice of goods. No one desires further 
exchanges. Although the demand for an item may be strictly less than the 
total supply of that item, this does not necessarily drive down the price of that 
item. Since the prices are relative and everyone may have enough of the other 
products, the price of the item in excess supply need not fall. 

Is there an equilibrium price vector? Can the economy be in balance? We 
will see that the answer is yes using the Brouwer Fixed Point Theorem. 

To do so, we define a function f : T -* T that maps the set of possible 
price vectors to itself and tells us how the demand for items causes prices to 
change. 

To see how prices change, we consider the following vector: 

DEFINITION 10.8. The excess demand vector is defined by E(p) = 
D(p) — S. 

The coordinates of the vector E(p) tell us whether or not there is more 
demand or more supply for the items at the prices in the vector p. If a coordinate 
of E(p) is positive, we expect the price of the corresponding item to rise since 
there is more demand than supply. People want more of this product at this 
price than is available. If the coordinate is negative, however, conditions could 
drive down the price of this item, as there is more supply than demand at this 
price. 

Using the excess demand vector, Walras’s Law can now be rewritten: 

Second form of Walras’s Law: p • E(p) = 0. (10.2) 

Given a price vector p, the vector E(p) points in the direction that we expect 
the prices to move from p. From it we can construct a function f that takes each 
price vector p in T to another price vector f (p) in T toward which we would 
expect the vector p to move. Hence we will have created a continuous map 
from T back to itself. The Two-Dimensional Brouwer Fixed Point Theorem 
tells us that this function must have a fixed point. In other words, there is a 
price vector p such that there is no incentive to trade goods when the prices are 
at p. At that price, the entire economy is in equilibrium. 

The rest of this section is devoted to making this idea mathematically 
precise by appropriately defining f : T -+ T and proving that a fixed point of 
f corresponds to an equilibrium price vector. 

We begin by defining a function f* : T -*■ R 3 by f*(p) = p + E(p). This 
function gives us an idea of how prices should move due to excess supply or 
demand. 
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However, there is a fundamental problem with this definition of f*. It does 
not necessarily send price vectors in T back to price vectors in T. The vectors 
that result might have negative entries, and their coordinates do not necessarily 
sum to 1. We need to remedy these problems so that we can be in a position to 
apply the Brouwer Fixed Point Theorem. 

In order to make all of the entries nonnegative, we define v + to be the 
vector obtained from a vector v by changing all negative entries into 0 entries. 
Then we define our new, improved function to be f**(p) = (p + E(p)) + . 
The function f** maps T to the first octant in M 3 , but f**(p) need not lie in 
T, since its entries do not necessarily sum to 1. However, by “normalizing” 
f**(p), which is to say, dividing the resulting vector by the sum of its entries, 
we obtain a vector in T. It is important to note that at least one coordinate of 
p + E(p) is positive, and therefore in normalizing f**(p) we are not dividing 
by 0. (See Exercise 10.15.) 

With (p + E(p))t representing the jth entry of (p + E(p)) + , we let fi (p) = 

]Cy=i(P + E(p))t, and we define our desired function, mapping T to itself, 
as follows: 


DEFINITION 10.9. The price change function f :T -»• T is defined by 


f(p) = 


(p + E(p))+ 

m 


Figure 10.7 depicts an example of the vectors p, E(p), p + E(p), 
(p + E(p)) + , and f (p) for a two-dimensional situation, as opposed to the three- 
dimensional one we have been considering. The second form of Walras’s Law 
implies that p and E(p) must be perpendicular. 



FIGURE 10.7: The vectors p, E(p), p + E(p), (p + E(p)) + , and f(p). 

The fact that D(p) is continuous implies that the price change function f (p) 
is also continuous. 

Now, we are ready to apply the Two-Dimensional Brouwer Fixed Point 
Theorem. Since f : T -* T is a continuous function and T is a disk, there 
must be a price vector p* such that f(p*) = p*. That is to say, there is some 
price vector that is fixed by f. We show, in fact, that such a vector is the 
equilibrium price vector we seek. 
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THEOREM 10.10. The fixed points of the price change function f are equi¬ 
librium price vectors. 

In order to prove this theorem, we need the following lemma: 

LEMMA 10.11. If p* is a fixed point off, then f(p*) = 1. 


Proof. Since p* is a fixed point of f, we have 


(p* + E(p* ))+ 
0(P*) 


f(p*) = 


= P* 


This implies that, for j = 1, 2, 3, 


(p* + E(p*))t = fi(p*)p*. 


(10.3) 


(10.4) 


Let a be an index value (equal to 1, 2, or 3) such that p* > 0. Such 
an a exists since the sum of the entries of p* is 1. Since f(p*) > 0 (by 
Exercise 10.15), it follows from Equation 10.4 that 

(p* + E(p*))+ >0. 

Therefore 

(p* + E(p*))+ = (p* + E(p*)) a . 

Plugging this back into Equation 10.4, we have 

(p* + E(p*))«' = fi(p*)Pa, 

which yields 

£*(p*) = (£ (P*) - 1 )P*. 

Multiplying both sides by p*, we obtain 

p*E a (p*) = - 1 )p*p*. (10.5) 


Although we derived Equation 10.5 with the assumption that p* > 0, 
the equation also holds when p* = 0 since it then reduces to 0 = 0. 
Therefore, since p*j is nonnegative, Equation 10.5 holds for all p*. Now, 
summing Equation 10.5 over all j, we obtain 

3 

p* • E(p*) = Y,P*Eji P*) 

;=i 

3 

= p *> - vpjpj 

7=1 

3 

= (j8(p*) - 1 yJ^PjPj 

7=1 

= 08(p*) - Dp* • p*- 
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By the second form of Walras’s Law (Equation 10.2), p* • E(p*) = 0. 
Therefore, 0 = (j8(p*) — l)p* ■ p*. Since p* € T, it follows that p* • p* = 
1. Thus, it must be that /8(p*) — 1=0, implying that £(p*) = 1, as 
desired. ■ 

With the help of Lemma 10.11, we can now prove Theorem 10.10. 

Proof of Theorem 10.10. Let p* be a fixed point for f. So f (p*) = p*. 
Since /3(p*) = 1, Equation 10.3 becomes 

(p* + E(p*)) + = p*, 

which is actually three equations of the form 

(p* + E(p*))+ = p*. (10.6) 

We consider two possibilities for p*j : either p* = 0 or p* > 0. Keep 
in mind that all prices must be nonnegative. 

If p* = 0, then (p* + E(p*))t = 0. Therefore Ej( p*)+ = 0, and it 
follows that Ej(p*) < 0. This means that there is no excess demand for 
item j in this case. 

Now, consider the case in which p* > 0. Then by Equation 10.6, 
(p* + E(p*))t > 0. Therefore, 

(p* + E(p*))t = (p* + E(p*)) j. 

Plugging this back into Equation 10.6, we obtain (p* + E(p*))y = p*, 
and therefore Ej( p*) = 0. 

Together, these cases imply that Ej(p*) < 0 for all j. Therefore 
Dj{ p*) < 5 j for all j, and p* is an equilibrium price vector. ■ 

When we have an equilibrium price vector p*, where Dj( p*) < Sj for all 
j, we say the markets clear. Everyone can achieve their demand vector. The 
Brouwer Fixed Point Theorem tells us there is such a price. So we can all sleep 
well at night. 

Nothing that we have said depended on the fact that we had three items 
available. The same would hold for an economy with hundreds of thousands 
of items and millions of individuals. Of course, establishing a corresponding 
result for a general setting would require the n-Dimensional Brouwer Fixed 
Point Theorem. 

Exercises for Section 10.2 

10.13. Describe the topological space of price vectors if the economy consists of only 
two items. Does the analysis of this section go through in that case? 

10.14. In an economy with four goods, the resultant set of normalized price vectors 
will yield the analog in 4-space of the triangle T we created in 3-space. What 
is it? How many faces does it have, and what are their equations? How many 
edges and vertices does it have? What are their equations? 
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10.15. Show that at least one coordinate of the vector p + E(p) is positive. 

10.16. Suppose that our entire economy consists of only Carmen and Dexter with 
initial bundles b c = (1, 1,2) and b° = (3,4, 2) and initial price vector p„ — 
(g, g, g). Let f be the price change function. 

(a) Assume that when the price vector is p = (g, |, g), Carmen has demand 
vector d c = (3,1,1) and Dexter has demand vector d° = ( 8 ,1,4). 
Determine f(p) for this p. 

(b) Assume that when the price vector is p' = ( 5 , 5 , 5 ), Carmen has demand 
vector d c = ( 5 , 5 , |) and Dexter has demand vector d D = (2,3,7). 
Determine f(p') for this p'. 


10.17. Let k be a fixed positive real number. Define 


ffe(P) = 


(p + fcE(p))+ 
E’=i(P + *E(p))+‘ 


Show that a fixed point p* of this general price change function also yields an 
equilibrium price vector. 


10.3 Kakutani’s Fixed Point Theorem 

In 1941, Shizuo Kakutani (1911-2004) proved a generalization of the Brouwer 
Fixed Point Theorem that has had powerful applications since. Instead of 
applying to functions from the n-ball B n to itself, Kakutani’s Fixed Point 
Theorem applies to so-called set-valued functions. Usually, functions associate 
a point x in a domain X to a point y in the range Y. Here, we will look at 
functions that take a point x in the domain X and send it to a nonempty subset 
A of the range Y. We denote such a set-valued function by / : X ->5 Y. (In 
this section we call our usual functions, whose values are points, point-valued 
functions, in order to distinguish them from set-valued functions.) 


EXAMPLE 10.6. Consider the set of all of the people who have ever lived. 
Let / assign to each person the set of all people that person has ever seen. This 
is an example of a set-valued function. To each point (person) the function / 
associates a set of points (the set of people that person has ever seen). 


EXAMPLE 10.7. The following are set-valued functions: 

(i) Let / assign to each real number x the set of all real numbers 
greater than x. We write f(x) = {y e E | y > x}. 

(ii) Let g assign to each real number x the set consisting of x and its 
negative. So g(x) = {— x, x}. 


(iii) Let h assign to each real number x the set [—1,4] if jc is negative 
or the set [—4, 1] if x is not negative. Therefore, 


h(x) = 


[-1.4] 
[-4, 1] 


if x < 0 , 
if x > 0 . 





Section 10.3 Kakutani’s Fixed Point Theorem 337 


For a point-valued function f : X -> Y, the graph is defined to be the 
set {(x, y) | y = /(x)} in X x Y. We need a similar notion for set-valued 
functions: 

DEFINITION 10.12. The graph of a set-valued function f : X F is the 
subset ofXxY given by G / = {(jc, y) | y G /(x)}. 


-1 

EXAMPLE 10.8. The graphs of the set-valued functions /, g, and h from 
Example 10.7 are shown in Figure 10.8. 



FIGURE 10.8: Graphs of the set-valued functions /, g, and h. 


In the hypotheses of the Brouwer Fixed Point Theorem, the continuity 
of the function is critical. For set-valued functions we do not directly define 
continuity, but instead we work with a property closely related to it. Combining 
the results of Exercises 4.10 and 7.13, it follows that if Y is compact and 
Hausdorff, then a point-valued function f : X Y is continuous if and only 
if the graph of / is a closed subset of X x Y . Therefore we focus on set-valued 
functions having graphs that are closed sets. In Example 10.8, the set-valued 
functions / and g have graphs that are closed sets, but the set-valued function 
h does not. 

Having a closed graph is advantageous since it implies that convergent 
sequences behave reasonably, as the following lemma indicates: 

LEMMA 10.13. Let f : X — Y be a set-valued function whose graph, Gf, 
is closed in X x Y. If (x n ) is a sequence in X that converges to xq g X, and 
(y n ) is a sequence in Y that converges to yo e Y and satisfies y n G f{x n ) for 
each n, then yo € /(xo). 

Proof. Form the sequence ((x, y)„) in X x Y defined by (x, y) n = 
(Xn, y„). Since y„ g f(x n ), this entire sequence lies in the graph of /. 
Since (x n ) converges to xo and (y„) converges to yo, ((x, y)„) converges 
to (xo, yo)- Now, either there exists N such that (xo, yo) = (xjv, yn), or 
(xo, yo) is distinct from every (x„, y n ). In the first case, it directly follows 
that (xo, yo) e Gf. In the second case, (xo, yo) must be a limit point 
of the set {( x n , y n )}ne z+ and therefore must also be a limit point of G /. 
Since Gf is closed, it follows that (xo, yo) e Gf in this case, as well. In 
either case, (xo, yo) is in the graph of /, and therefore yo e f (xo) as we 
wanted to show. B 
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Now, what does it mean for a set-valued function to have a fixed point? 

DEFINITION 10.14. Given a set-valued function F : X —>•$ X, a fixed point 
of F is a point x* in Xfor which x* e F(x*). 

A fixed point of a set-valued function is a point that maps to a set containing 
the point, in contrast to a fixed point of a point-valued function, which is a point 
that simply maps to itself. 

The functions considered in the Kakutani Fixed Point Theorem are assumed 
to have a domain that is a polyhedron in E", where a polyhedron is a bounded 
set that can be expressed as a solution set to finitely many inequalities of the 
form a\x\ + ... + a n x n < b. (See Definition 0.12.) 

The n -Dimensional Kakutani Fixed Point Theorem states that for a poly¬ 
hedron X in R' ! , a set-valued function F : X $ X has a fixed point if F (x) is 
a convex subset of X for each x in X and if the graph of F is closed in X x X. 

The n -Dimensional Kakutani Fixed Point Theorem requires the n -Dimen¬ 
sional Brouwer Fixed Point Theorem in its proof. Here we only address the 
Kakutani Fixed Point Theorem in dimensions one and two, but the proof di¬ 
rectly generalizes assuming the n-Dimensional Brouwer Fixed Point Theorem. 

For the one-dimensional case, the possibilities for the polyhedron X C E 
are limited. In fact, such sets must be closed and bounded intervals [a, b\. (See 
Exercise 10.20.) We ask you to prove the One-Dimensional Kakutani Fixed 
Point Theorem in Exercise 10.21. 

Now we present the Kakutani Fixed Point Theorem in dimension two: 

THEOREM 10.15. The Two-Dimensional Kakutani Fixed Point The¬ 
orem. Let X be a polyhedron inR 2 andF : X Xbe a set-valued function 

such that F(x) is a convex subset of X for each x in X. If the graph of F is 
closed in X x X, then there exists x* e X such that x* e F(x*). 

Proof A polyhedron in M 2 is either a point, a line segment, or a con¬ 
vex polygon. The cases for a point or line segment fall under the one¬ 
dimensional version of the theorem, assigned in Exercise 10.21. Here we 
address the case for a convex polygon in R 2 . For simplicity, we prove the 
theorem when A is a triangle T in R 2 . We then discuss how the method 
of proof for a triangle carries over to a general convex polygon. 

Let T be a triangle in the plane with vertices Vq, Vq, and Vq. (The 
reason for using the multiple indices will become clear as we progress.) 
Since T is a triangle, every point in T can be represented as a lin¬ 
ear combination of Vq, Vq, and Vq. Specifically, for x e T we have 
x = AqVq + AqVq + /.qVq, where each Aq > 0 and Aq + Aq + A ( 3 = 1. 
(See Figure 10.9.) 

Notice that T is homeomorphic to the disk, and therefore the Two- 
Dimensional Brouwer Fixed Point Theorem applies to every continuous 
function from T to T. 

To find a fixed point of F, we build a sequence fo, fi, f2,... of 
point-valued continuous functions that approximate F. By the Two- 
Dimensional Brouwer Fixed Point Theorem, each f„ has a fixed point 



Section 10.3 Kakutani's Fixed Point Theorem 339 



FIGURE 10.9; Every point in T is a linear combination of the vertices. 

x„ € T. We prove that the sequence of fixed points (x„) has a subse¬ 
quence that converges to a fixed point of F. 

For each of the three vertices Vq, i = 1, 2, 3, we pick a particular 
pointy^ € /(Vq). Then we define a function fo : T -» T sothatfoCvg) = 
yg on the vertices. We extend this linearly to the triangle by, for each 

X = XqVq + Ao v o + *o v o> settin g f o(x) = ^0 + + ^o- 

Notice that fo is not a set-valued function. It is a point-valued function 
mapping T to itself. Moreover, because fo is defined to be the linear 
combination of its values at the vertices, it is continuous. Therefore the 
Two-Dimensional Brouwer Fixed Point Theorem applies, and we have a 
point xo e T such that fo(xo) = xo. 

The point xo is not necessarily a fixed point of the set-valued function 
F (but would be, for instance, if it was one of the vertices of T). 

Now, to define the next function fi in our sequence of functions 
approximating F, we begin by subdividing T into four smaller triangles, 
as shown in Figure 10.10. 



FIGURE 10.10: Subdividing T into smaller triangles. 


The vertices of these four triangles are made up of the vertices of T 
and the midpoints of the edges of T, given by 


I V 1 _i_ i v 2 -Lv^ 4- iv^ and 4- lv^ 
2 V 0 ' 2 V 0> 2 V 0 + 2 V 0’ ana 2 V 0 ' 2 V 0’ 


We define fi (x) in a manner analogous to how we defined fo(x). For 
each vertex v in the four triangles, we choose a point y in F(v) and define 
fj (v) = y. Then, as with fo, we extend fi linearly over each triangle. 
Notice that if x lies in the intersection of two different triangles, then 
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the definition of fi (x) in terms of either triangle is the same because 
it depends only on the definition of fj on the two vertices that are the 
endpoints of the edge containing x. 

The function f[ maps T to itself and is continuous since it is a linear 
extension of the values at the vertices. Therefore, by the Brouwer Fixed 
Point Theorem, there is a fixed point X] of fi . Here too, xj is not neces¬ 
sarily a fixed point of the set-valued function F. But it would be if it was 
one of the vertices of the four triangles in the subdivision of T. Let T\ be 
a subdivision triangle that contains this new fixed point x\ , and assume 
that the vertices of T\ are vj, V|, and v 3 . 

We continue this process. Specifically, assume that we have a con¬ 
tinuous f„_i : T -* T with fixed point x„_i in triangle 7„_i c T having 
vertices \\ _ j, v 2 _ ,, and v 3 _,. 

To define f„, we subdivide each of the triangles used in the definition 
of f„_i into four subtriangles as described earlier. Then for each vertex v 
in each triangle, we define f„ (v) to be a point in F(v). Finally, we extend 
f„ linearly over each triangle in the subdivision to obtain a continuous 
function f„ : T -> T. Since f„ is continuous, it has a fixed point x„ e T. 
Let T n C T be a triangle in the subdivision containing x„, and assume 
T n has vertices v^, v 2 , and v 3 . 

We now have a sequence xo,xi,X 2 , ... offixed points of the functions 
f 0 ,fi,f 2 ,... • Note that x„ = + A 2 v 2 + A 3 v 3 for some values X ] n , 

X", and A 3 in [0, 1], Furthermore, if we have y J „ = f„ (v„) for each n and 
7, then x„ = f„(x„) = X ] n y l n + X 2 n y 2 n + A 3 y 3 as well, yielding 


x « = *Wn + + *«y l + ( 10 -7) 


Since T is a compact subset of the plane, Theorem 7.16 implies that 
every sequence in T has a convergent subsequence. Let x* be the limit 
of a convergent subsequence ( xj n ) of the sequence (x„). We show that 
x* is a fixed point of the set-valued function F. 

Because the side lengths of the triangles T n shrink to zero as n ap¬ 
proaches infinity, every sequence made up of one point from each triangle 
Tjn must also converge to x*. Hence, the sequence of fixed points (xj n ) 
and the corresponding sequences of vertices (vj n ), (v^), and all 
converge to x*. 

Since the interval [0, 1] is compact in E, Theorem 7.16 implies that 
every sequence in [0, 1] has a convergent subsequence. Thus the three co¬ 
efficient sequences (7*^), (,X 2 J, and (A.^) have convergent subsequences. 

Similarly, the three sequences (y*. ), (y? ), and (y^ ) are sequences in the 
compact set T and thus also have convergent subsequences. By taking 
subsequences, one at a time, of all of these additional sequences, we can 
thereby choose a single indexing sequence ( m n ) such that all ten corre¬ 
sponding subsequences converge: (x m „) to x*, (v^ n ) to x*, (v^ n ) to x*, 
(VmJ tox*, (X^J toavalue A 1 , (A^) to a value A 2 , (A 2 , ) to a value A 3 , 
(y m n ) to a point y 1 , (y 2 n ) to a point y 2 , and (y 3 , n ) to a point y 3 . 
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As m n approaches infinity, we see from Equation 10.7 that x* = 
A-'y 1 + A. 2 y 2 + A. 3 y 3 . Furthermore A. 1 + X 2 + A. 3 = 1, and A. 1 , A. 2 , A 3 e 
[0, 1]. Therefore x* is in the “triangle” with vertices y 1 , y 2 , and y 3 . (We 
use quotes since the triangle could be a line segment or a point if exactly 
two of the y‘ are equal or if they all are equal, respectively.) 

Since F has a closed graph in T x T, Lemma 10.13 applies. There¬ 
fore, since the sequences (v^ n ) and (y ] mn ) converge to x* and y 1 , respec¬ 
tively, and since y l mn e F(v^) for each m„, we know that y 1 e F(x*). 
Similarly, y 2 € F(x*) and y 3 e F(x*). 

But x* = A.y + A. 2 y 2 + A. 3 y 3 is in the triangle with vertices y 1 , y 2 , 
and y 3 , all of which lie in F(x*). Since F(x*') is convex, x* must also be 
contained in F(x*). In other words, x* 6 F(x*), as we wished to show. 

We have now proven the Two-Dimensional Kakutani Fixed Point 
Theorem assuming that the domain is a triangle in the plane. To prove 
the result for a general convex polygon in the plane, we use the same 
approach, but we start by subdividing the polygon into triangles. The 
initial approximating function fo is first defined on the vertices of these 
triangles and then is extended linearly to each of the triangles, just as was 
done in the foregoing process. Then, as was done previously, succes¬ 
sive approximating functions f„ are defined by subdividing each triangle 
considered at the previous stage, defining f„ on the vertices of the new 
triangles, and extending the definition linearly to each triangle. The same 
argument yields a point x* such that x* e F(x*). ■ 

This method of proving the Two-Dimensional Kakutani Fixed Point Theo¬ 
rem carries over to the general ^-dimensional version, with the ^-Dimensional 
Brouwer Fixed Point Theorem required along the way. 

Kakutani’s Fixed Point Theorem contains the Brouwer Fixed Point Theo¬ 
rem as a special case, but we need to view the domain of the Brouwer Fixed 
Point Theorem as a polyhedron to see this. Let 

P n = {(*i,... , x n ) e R" | - 1 < xj < 1 for; = 1,... ,n }. 

The set P n is an n-dimensional cube. By definition P n is a polyhedron, and it 
is homeomorphic to the n-ball, B n . Let us consider the Brouwer Fixed Point 
Theorem on P n . If f : P n -> P n is a continuous point-valued function, then 
f can be considered a set-valued function mapping x € P n to the single-point 
set (F(x)}. The domain of f is a compact polyhedron, and (f(x)} is a single¬ 
point set in P n and therefore is a nonempty convex subset of the domain. 
Furthermore, the point-valued function f is continuous; thus the graph of f is 
closed in P n x P n by Exercise 4.10. Kakutani’s Fixed Point Theorem then 
applies, implying that there exists x e F" such that x € {f (x)}, and therefore 
x = f (x). 

It is important to realize, however, that although the Brouwer Fixed Point 
Theorem is a special case of the Kakutani Fixed Point Theorem, the latter does 
not supplant the former, since the Brouwer Fixed Point Theorem is needed in 
the proof of the Kakutani Fixed Point Theorem. 
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Exercises for Section 10.3 

10.18. For each of the following set-valued functions, draw the graph and determine 
whether or not F(x) is convex for all values of x in the domain: 

(a) Let F : R R be given by F(x) = {y e R | y < x}. 

(b) Let G : R -*■$ R be given by G(x ) = {nx e R | n e Z). 

(c) Let H : R R 2 be given by H{x) = {(yi, yj) e R 2 I yiy 2 = x). 

10.19. Define a relation on R by a ~ b if there exists an integer c such that a, b € 
[c,c+ 1 ). 

(a) Show that ~ is an equivalence relation. 

(b) Let / : R —>■$ R, be the set-valued function defined by fix) = [x], where 
[x] is the equivalence class of x under the equivalence relation ~. Sketch 
the graph of /. 

(c) With / as defined in (b), is /(x) convex for each x e R? 

(In general an equivalence relation on a set X gives rise to a natural set-valued 
function, sending each point in X to the equivalence class of points in X that 
contains it.) 

10.20. Prove that a polyhedron in R must be a closed and bounded interval [a, b\. 

10.21. Explain the proof of the Kakutani Fixed Point Theorem in the case that X is 
the interval [0, 1]. Include pictures of n graphs of F, /o, and f\ and the fixed 
points xo and x\ in your explanation. 

10.22. Determine whether or not the Kakutani Fixed Point Theorem applies to each 
of the following set-valued functions. Note that you need to check that X is 
a polyhedron, that F(x) is convex for each x 6 X, and that the graph of F is 
closed. If Kakutani’s Fixed Point Theorem applies, find as many fixed points 
as you can for F. 

(a) X = [0,1] and F(x) = {y € [0,1] | y > 1 - x}. 

(b) X = [0,1] and F(x) = {y e [0,1] | £ is irrational). 

(c) X = [0, 1] and F(x) = {y € [0, 1] | y < x 2 }. 

(d) X — [0, 00 ) and F(x) = (ye [0, 00 ) | y > 2x). 

(e) X = [0, 1] U [2, 3] and 

f[2,3] if x e [0, 1], 

[10,1] if x € [2,3]. 

(f) X = [0, 1] x [0,1] and 

F(x, y) = {(x, y) e [0, 1] x [0, 1] | x > y and y = 1 — x). 


10.23. For each of the following hypotheses in the Kakutani Fixed Point Theorem, 
give an example to show that the theorem does not hold if the hypothesis is 
dropped. That is, find a set-valued function F : X ->5 X that satisfies each of 
the other two hypotheses, but does not have a fixed point. 

(a) X is a polyhedron. 

(b) F(x) is convex for each x e X. 

(c) G f is closed. 
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10.4 Game Theory and the Nash Equilibrium 

In this section we introduce game theory and provide an application of the 
Kakutani Fixed Point Theorem, using it to prove John Nash’s celebrated the¬ 
orem on the existence of equilibria in n-person games. Although all of what 
follows applies to n-person games, we will restrict ourselves to at most three 
players in order to keep the notation straightforward. 

Suppose that Elaine, George, and Newman agree to play a game. At a 
given turn, each of the three has a finite number of moves that she or he can 
make. For Elaine, these choices are labeled 1 through ri£. For George, they 
are labeled 1 through no- And for Newman, they are labeled 1 through ny. 
These moves are called pure strategies. 

We set the rules of our game so that all three players make a move without 
knowing what the other players have chosen to do. Then they each receive 
some payoff. When Elaine makes move i , George makes move j, and Newman 
makes move k. Consequently, Elaine receives payoff Eqk, George receives 
payoff Gijk, and Newman receives payoff N,jk- Each player has an associated 
n e x no x riN three-dimensional payoff array and all three are familiar with 
the three payoff arrays. 

Of course, if Elaine always made the same move, it would not take George 
and Newman long to decide how best to maximize their own payoffs relative 
to hers. So instead of consistently making move i, Elaine may choose to play a 
different strategy. She may choose to play each move with a certain probability. 

It makes good sense to do this. For instance, in a poker game, a player 
does not want to bluff every time certain cards are in her hand, as the other 
players may soon pick up on the fact that she consistently does this. Rather, 
she should bluff with a certain probability that she has predetermined. This 
will be her strategy. Then no particular card pattern will be associated with her 
bluffing. 

We denote the probability that Elaine plays move i by pt, that George 
plays move j by qj, and that Newman plays move k by r*. We make the basic 
probability assumptions that 

Pi > 0, qj > 0, and n > 0 for all i, j, k, (10.8) 

and 

Y^pt = i, Yl q j = and Yl rk = l - (10 - 9 ) 

i j k 

DEFINITION 10.16. The corresponding probability vectors 

P = (p i > pi. • • • > Phe )> q = (qi > q2. • • • > 

and r = {r\,r 2 ,... ,r„ N ) 
are called mixed strategies. 

Now, the payoff for each player becomes an expected value. For Elaine, it 
is given by 

E( p, q, r) = Y2 EijkPiqjn. 

ij,k 
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This expected value is the sum of the possible payoffs multiplied by the prob¬ 
ability that such a payoff occurs. We similarly define the expected values 
G(p, q, r) and N( p, q, r) for George and Newman, respectively. If Elaine, 
George, and Newman play their games enough times, we expect their pay¬ 
offs to average out to approximately f?(p, q, r), G(p, q, r), and N( p, q, r), 
respectively. 


EXAMPLE 10.9. Suppose Elaine, George, and Newman are deciding where 
to go for dinner. They can choose either the Happy Star Chinese Restaurant 
or the New Yorker Diner. All three agree to simultaneously yell out either 
“Chinese” or “diner.” If two of them agree and the third does not, they all go 
to the restaurant chosen by the two who agreed, and the third has to pay 10 
dollars to each of the others for their dinner. If all three agree, they go to the 
restaurant that they all chose, and everyone pays for their own dinner. 

Elaine’s payoff array then has entries of the form 


Eijk — 


-20 

if i # j = 

0 

if i = j = 

10 

otherwise. 


The payoff arrays for George and Newman are defined similarly. 

Let us assume that Elaine, George, and Newman are equally likely to pick 
the Chinese restaurant or to pick the diner on any given evening. Then their 
mixed strategies are all the same, namely p = q = r = (j, j). From this, we 
can compute the expected value of Elaine’s payoff. We find that 

£ = — 20 (±) + 0 (±) + 10 (±) = 0 . 

Similarly, the expected values for George and Newman are also both 0. 


Now, suppose that in a particular game, Elaine knows that George and 
Newman are using mixed strategies q and r, respectively. Elaine’s goal is to 
obtain the largest possible payoff; that is, she wants to maximize her expected 
value, given q and r. Thus, she wants to find a strategy p so that E (p, q, r) > 
E( p', q, r) over all possible probability vectors p'. There may be more than 
one such strategy for Elaine, and therefore we have the following definition: 


DEFINITION 10.17. Let P (q, r) be the collection of all probability vectors p 
that satisfy E (p, q, r) > £(p', q, r) over all mixed strategies p'. This is called 
the set of optimal mixed strategies associated to mixed strategies q and r. 


Similarly, <2(p, r) is the set of optimal mixed strategies for George, max¬ 
imizing his payoff when p and r are given, and R(p, q) is the corresponding 
set of optimal mixed strategies for Newman, given p and q. 
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Now, how does Elaine choose vectors to maximize the expected value 
E( p, q, r)? For a fixed q and r, the expected value is a linear equation in the 
components of p, as follows: 

£(p, q, r) = ]T Ei jk piqjr k = BE Eijkqj r k)Pi- (10.10) 

i,j,k i j,k 

Letting a,- = £ • k Eijkqjn, we can rewrite Equation 10.10 as 

E (p, q, r) = a\p\ + a 2 p2 + ... + a nE p nE . 

Once we have rewritten the expected value in this way, it becomes clear how 
to maximize it, as in the following example: 


EXAMPLE 10.10. Suppose that in a particular game played by Elaine, 
George, and Newman, there are five possible moves from which Elaine can 
choose. Further suppose that for the fixed mixed strategies q and r of George 
and Newman, respectively, The expected value of Elaine’s payoff is 

£(p, q, r) = 3pi + 5 p 2 + 4p 3 + 5p 4 + p 5 . 

She wants to find p = {p\, p2, pj, P 4 , ps) to maximize E (p, q, r). Clearly,the 
best choice for Elaine is to make the components of p with smaller coefficients 
as tiny as possible and those with the largest coefficients as big as possible. In 
this case, the largest coefficient is 5. So p\, pj, and ps should be chosen to be 
0. Since the coefficients of p 2 and p 4 are both 5, she could choose p 2 = 1 and 
P4 = 0. Or she could choose P 2=0 and p4 = 1 . In fact, she could choose p2 
and p4 to be any combination that adds to 1 . She will still have a probability 
vector that maximizes the payoff. Therefore she chooses her set of optimal 
mixed strategies to be 

P(q, r) = {(0, p 2 , 0, p 4 , 0)\p 2 + P4 = 1, P2 = 0,p4 = 0). 


j 


In general, we see that the set of optimal mixed strategies P (q, r) is given by 

P(q, r) = {p | T Pi = 1 and /?,• = 0 if a,- < max{a y }}. ( 10 . 11 ) 
i 1 

It follows that P( q, r) is a closed, bounded, and convex subset of R" £ . (See 
Exercise 10.26.) 

Now, if Elaine chooses an optimal mixed strategy p in response to the 
mixed strategies q and r of George and Newman, then George might want to 
change his mixed strategy to an optimal one, given that Elaine and Newman 
have chosen mixed strategies p and r. And, of course, Newman might want to 
change his mixed strategy as well. 

We would like to know if there is a choice of mixed strategies for all of the 
players such that every player is using an optimal mixed strategy at the same 
time. Under that circumstance, there would be no incentive for the players to 
change strategies. 
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DEFINITION 10.18. Mixed strategy vectors p, q, and r are said to solve the 
game if p e P(q, r), q 6 Q(p, r), and r e P(p, q). We then say that p, q, 
and r are a Nash equilibrium. 

In a Nash equilibrium, no player has an incentive to change his or her 
strategy, assuming no one else changes theirs. Each player is making as much 
as he or she possibly can, given the current strategies of the other players. 

In his doctoral dissertation at Princeton University in 1950, John Nash 
proved that every game has at least one equilibrium (which later became known 
as a Nash equilibrium). His results revolutionized the field of game theory and 
subsequently had a significant impact on economics and the social sciences. 
In 1994, the Nobel Prize in economics was awarded to John Nash, Reinhard 
Selten, and John C. Harsanyi, with the latter two receiving the award for their 
refinements to the Nash equilibrium theory for situations where multiple equi¬ 
libria exist and situations where each player does not have complete information 
about the other players’ strategies in a game. 

We present and prove Nash’s Theorem momentarily, but first we look at 
an example. 


EXAMPLE 10.11. Let us determine the Nash equilibria for the restaurant 
game of Example 10.9. We consider the situation from Elaine’s point of view. 
Let p = (a, 1 — a) be her mixed strategy. So a is the probability that she picks 
the Chinese restaurant, and I — a is the probability that she picks the diner. 
Similarly, let George’s mixed strategy be q = (b, 1 - b), and let Newman’s be 
r = (c, 1 — c). 

Then the expected value for Elaine’s payoff is 

E = I0[ab(l -c) + a(l - b)c + (1 - a)(l - b)c + (1 - a)b( 1 - c)] 

— 20 [a(l - b)( 1 - c) 4- (1 - a)(bc)], 

which simplifies to 


E — 10[2 a{b + c— \) + c-\-b — 3 cb]. 

For the time being, assume that b and c are fixed. Let us see how Elaine 
can maximize E by choosing an appropriate a. We consider three different 
possibilities for the sum b + c. First, if b+c > 1 , then Elaine obtains maximum 
E by choosing a — 1. Note that with a = 1, both a + b and a + c are greater 
than 1, since both b and c must be nonzero for b + c > 1 to hold. Then, with 
a + c > 1, by the same argument we used for Elaine, it follows that George 
can maximize his expected value by choosing b = 1 . Similarly, by choosing 
c = 1, Newman can maximize his expected value. Soa = & = c= lisa 
Nash equilibrium. This equilibrium corresponds to having everyone always 
choose the Chinese restaurant. 

In like fashion, it can be shown that if b + c < 1, then all three players 
will maximize their expected values with the choices a = b = c = 0. (See 
Exercise 10.27.) Therefore everyone always choosing the diner is a second 
Nash equilibrium. 
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The last possibility to consider is b + c — 1. In this case it does not matter 
what probability Elaine picks, as far as her expected value goes. But from the 
perspective of George and Newman, it does matter. If either a + b or a + c does 
not equal 1, then, as we saw earlier, the corresponding player will change his 
probability to 0 or 1, forcing the other players to change, and having everyone 
settle on one of the two equilibria already mentioned. The only case where this 
does not occur is if a + b = 1 ,a + c = 1, and b + c = 1. In this case we have 
a = b = j, and c = j. So this is a third Nash equilibrium; it corresponds 
to having each player choose each restaurant half of the time. 


Before we proceed with Nash’s Theorem, we review the components that 
lead up to it: 

(i) In an n-person game each player has a choice of moves to make, 
and each move is made without knowing the moves of the others. 

(ii) Associated to each player is a payoff array giving the payoffs the 
player receives for all of the possible outcomes associated with 
each player making a choice of move. 

(iii) Each player can choose a mixed strategy, and the combined choices 
of mixed strategies determine the average payoff, or expected value, 
for each player. 

(iv) Each player has a set of optimal mixed strategies for each choice of 
mixed strategies by the other players. Each optimal mixed strategy 
results in the maximum possible expected value for the player given 
the mixed strategies of the others. 

(v) A Nash equilibrium is a choice of mixed strategies that yields, for 
each player, the maximum possible expected value relative to the 
mixed strategies of the other players. 

THEOREM 10.19. Nash’s Theorem. There exists a Nash equilibrium 
for every n-person game. 

Proof. We present the proof for three-person games to keep the notation 
simple, but the same proof applies to n-person games. Given the three 
payoff arrays E\jk, Gijk, and Nijk, we need to demonstrate that there is 
a set of mixed strategies p*, q*, and r* that solves the game. 

Let m = n£ + nc + n#. For a choice of mixed strategy vectors 
p = (p i,... , p nE ), q = ( q\ ,... ,q„ G ), and r = (n, ... ,r„ N ), we 
define an m -vector by concatenating the components of these vectors, as 
follows: 

w = (p, q, r) = (p i,... ,p„ E ,q x ,... ,q na ,r u ... ,r„ N ). 

Each such vector w represents a combined choice of individual mixed 
strategies from each player. Its components must satisfy Inequalities 
10.8 and Equations 10.9. It follows that the set of possible vectors w is 
a polyhedron X in R m . 
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We define a set-valued function on X by 

F(p, q, r) = {(p', q', r') | p' € P(q, r), q' e Q( p, r), r' e F(p, q)}. 

In other words, the vector w = (p, q, r) is sent to the collection of 
vectors that have the property that their first n e components are an optimal 
strategy for Elaine, given that George and Newman stick with strategies 
q and r; their second tie components are an optimal strategy for George, 
given that Elaine and Newman stick with strategies p and r; and their last 
n n components are an optimal strategy for Newman, given that Elaine 
and George stick with strategies p and q. 

If we show that there is a w* such that w* e F(w*), then we will have 
proven the theorem, because such a vector w* = (p*, q*, r*) is made up 
ofthree vectors, p*,q*, and r* such that p* € P(q*,r*),q* € g(P*, r *)> 
and r* € F(p*, q*). Thus, we need to prove that there is a fixed point 
for the set-valued function F:X->-sX. 

We show that Kakutani’s Fixed Point Theorem applies to F. It was 
previously observed that X is a polyhedron in R m . Since each of F(q, r), 
<2(p, r), and R( p, q) is convex, it must be that F(w) is convex as well. 
(See Exercise 10.25.) Hence, we need only show that the graph G/r is 
a closed subset of X x X C R 2 " 1 . Let (xo, yo) be a limit point of Gf. 
For each positive integer i, pick a point (x,-, y, ) in the intersection of Gp 
with the ball of radius 1 /1 centered at (xo, yo). We obtain a sequence 
of points (Xj, y,-) in G F converging to (x 0 , yo). Let x,- = (p, , q,, r,), 
y i = (s;, t/, Uj)» x 0 = (Po, qo, r 0 ), and y 0 = (s 0 , to, u 0 ). We then have 
the following convergent sequences: (p, ) to po, (q, ) to qo, (r, ) to ro, (s,) 
to so, (tf) to to, and (u, ) to uo- 

Note that y,- e F(x,) for all i. Therefore s,• s F(q, , r,), t, e 
<2(Pi, r ( ), and u, € F(p,, q,). Hence, for all p', q', and r', we have 

£(s/,q,•,!•/) > F(p',q/,r/), 

G(Pi, t/, r/) > G (p,, q , r,), and 
N (p,•, q,, U;) > N(pi , q,-, r'). 

Since E, G, and N are continuous functions, these inequalities hold 
as we take the limit as i approaches infinity. (See Exercise 10.24.) This 
implies that 

E( s 0 , qo, r 0 ) > F(p', qo, r 0 ), 

G(p 0 , to, r 0 ) > G(p 0 , q', r 0 ), and 
N( po, qo, u 0 ) > N (po, qo, r'). 

Therefore s 0 € F(q 0 , r 0 ), to e g(po, r 0 ), and u 0 e F(p 0 ,qo). 
Thus, yo € F(xo), implying that (xo, yo) is in the graph of F. It follows 
that the graph of F is closed. The Kakutani Fixed Point Theorem now 
applies, and therefore there exists a w* such that w* e F(w*), as we 
wished to show. ■ 
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Thus, for our three-player game, we are assured that a Nash equilibrium 
exists, and therefore it is possible for each player to choose a mixed strategy 
that results in maximum expected value relative to the others’ choices of mixed 
strategy. 

Even though the previous proof involves a three-person game, we need the 
general version of the Kakutani Fixed Point Theorem to establish the existence 
of the Nash equilibrium. This is because it is the total number of plays that the 
players can make, rather than the total number of players, that determines the 
dimension of the space in which we are working. 

As we indicated earlier, these results carry over to n-person games for any 
positive integer n. The proof for the general case is essentially the same as the 
proof presented here, with the Kakutani Fixed Point Theorem establishing the 
existence of the Nash equilibrium. 


EXAMPLE 10.12. Here we look at an example of the classic prisoner’s 
dilemma two-person game. George and Newman have been charged with 
a crime, and each of them is interviewed separately about it. Since the prose¬ 
cutor does not have sufficient evidence for a conviction, George and Newman 
are each offered a minimal sentence for telling on the other. So, George and 
Newman can each either remain silent or tell on the other. 

If both George and Newman choose to be silent, each will receive a six- 
month sentence for a lesser crime. If they each tell on the other, then they each 
receive a reduced 24-month sentence because they assisted in the prosecution. 
If one tells while the other is silent, then the one who told receives a three- 
month sentence while the one who remained silent receives the full 72-month 
sentence. 

For each player, the four possible outcomes and the resulting consequences 
are displayed in the payoff matrices in Figure 10.11. The payoffs G, j and 
Nij in the matrices represent the sentences (in months) received by George 
and Newman, respectively, in each circumstance. Here, of course, the desire 
of each is to minimize their sentence. 


George George 

SILENT TELL SILENT TELL 


H 
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3 

Z 
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72 
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Id 
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1— 
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FIGURE 10.11: The prisoner's dilemma payoff matrices. 

It is straightforward to show that the only Nash equilibrium in this game cor¬ 
responds to having both players always tell on the other. (See Exercise 10.28.) 
Note, however, that the Nash equilibrium does not yield the best cooperative 
outcome of the game. If George and Newman agreed in advance to be silent, 
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they would receive a better result than the one in the Nash equilibrium. How¬ 
ever, since the game is not played cooperatively, for each player the threat of 
having the other player tell on them while they remain silent leads the player 
away from silence into the equilibrium. 


Exercises for Section 10.4 

10.24. Assume that / : X — > R is a continuous function, (*,•) converges to jco, and 
(y,) converges to y 0 - Show that if /(*, ) > f(y,) for all i, then f(x () ) > f(yo). 

10.25. Prove that if A is a convex subset of R" and B is a convex subset of lR m , then 
A x B is a convex subset of R” +m . 

10.26. Given that the set of optimal mixed strategies P(q, r) satisfies Equation 10.11, 
prove that P(q, r) is a closed, bounded, and convex subset of M" £ . 

10.27. Show that in Example 10.11, if we have a Nash equilibrium with b + c < 1, 
then a = b = c = 0 . 

10.28. Show that the only Nash equilibrium in the prisoner’s dilemma game in Exam¬ 
ple 10.12 corresponds to having both players always tell on the other. 

10.29. Elaine, George, and Newman are playing the same restaurant game as in Ex¬ 
ample 10.9. Suppose, however, that they have payoffs as follows: 

(i) If George and Newman agree, regardless of what Elaine calls out, 
they then go to the restaurant that George and Newman chose, and 
Elaine has to pay them three dollars each for coffee. 

(ii) If Elaine agrees with either George or Newman, but not both, they 
then go to the restaurant chosen by the two who agreed, and the 
third person pays 10 dollars to each of the others for dinner. 

(iii) If all three agree, they then go to the restaurant that they all chose, 
and George and Newman each pay Elaine five dollars toward her 
dinner. 

Find all Nash equilibria for this game. 

10.30. Elaine, George, and Newman can choose between the Happy Star Chinese 
Restaurant, Bobo’s Burgers, or Pat’s Pizza in the same game as in Example 10.9. 
Suppose that when two or three of them agree, the payout is the same as in the 
game in Example 10.9, and when all three disagree they all eat at home and 
there is no payout. 

(a) If all three use a mixed strategy vector ( 5 , 3 > 5 ) to determine which restau¬ 
rant to yell out, what are the resulting expected values? 

(b) Find all Nash equilibria for this game. 




Chapter 11 

Embeddings 


Along with the study of topological spaces, an important component of topol¬ 
ogy is understanding how topological spaces can sit within each other. Recall 
from Section 4.2 that we define an embedding of one topological space X in 
another topological space Y to be a homeomorphism from X to a subspace of 
Y. In this way, we make precise the idea of a copy of X sitting in Y. 

In this chapter we explore a variety of important embedding theorems 
in topology, starting with an overview of a number of interesting embedding 
questions and results in Section 11.1. In Section 11.2 we prove the Jordan 
Curve Theorem, a classic theorem in topology, which says that an embedded 
circle in the plane separates the plane into two components, each of which has 
the embedded circle as its boundary. In Section 11.3 we introduce a digital 
version of the Jordan Curve Theorem, and we discuss an application of it to 
digital image processing. 


11.1 Some Embedding Results 

In this section we consider two fundamental embedding questions: 

(i) Given topological spaces X and Y, is there an embedding of X in 7? 

(ii) If there is an embedding of X in Y, what can we say about how the 
image of the embedding of X sits in 7? 


DEFINITION 11.1. If f : X -*■ Y is an embedding of space X into space Y 
we say that X is embeddable in Y, or that we can embed X in Y. Furthermore, 
we call Y the embedding space and we call the image f(X) the embedded 
image of X. 

We introduced a result related to the first of these questions earlier in the 
book, without expressing it in terms of embeddings. When we introduced the 
Klein bottle in Section 3.4, we indicated that it does not exist in 3-space. That 
is to say, there is no embedding of the Klein bottle in R 3 . The same is true for 
the projective plane. We do not develop the tools to prove these nonembedding 
results in this book, but in this section we demonstrate that the Klein bottle 
is embeddable in M 4 , and we ask you to find an embedding of the projective 
plane in R 4 in Exercise 11.9. 

Let us begin by considering embeddings of the circle into topological 
spaces. Recall that the image of such an embedding is called a simple closed 
curve. Since S l is compact, it follows that simple closed curves are compact 
subsets of the embedding space. 
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There are no simple closed curves in R, and the only simple closed curve 
in the circle S 1 is the circle itself. (See Exercise 11.1.) 

We need another dimension in our embedding space for simple closed 
curves to be interesting, so let us consider simple closed curves in the plane. 
They can be straightforward, as we see on the left in Figure 11.1, or they can be 
quite complicated, winding throughout a region in the plane, as we see on the 
right. Despite this, simple closed curves in the plane have much in common, 
as asserted by the Jordan Curve Theorem. 



FIGURE 11.1: Simple closed curves in the plane. 


THEOREM 11.2. The Jordan Curve Theorem.. Let S be a simple 
closed curve in R 2 . Then R 2 — 5 consists of two components, and S is the 
boundary in R 2 of each of the components. 

This result was first presented by Camille Jordan (1838-1922) in 1887 in 
his work Cours d’Analyse de I’Ecole Poly technique. However, the proof he 
gave was incorrect. It was not until eighteen years later, in 1905, that Oswald 
Veblen (1880-1960) provided the first correct proof in [Veb], We prove the 
Jordan Curve Theorem in the next section. 

A simple closed curve in the plane is compact and therefore is closed and 
bounded. Since it is bounded, its complement in the plane has one unbounded 
component. (See Exercise 6.19.) Thus it follows from the Jordan Curve 
Theorem that the complement of a simple closed curve in the plane has one 
unbounded component and one bounded component. 


DEFINITION 11.3. Given a simple closed curve S in the plane, we call the 
bounded component o/R 2 — S the inside of S and the unbounded component 
the outside of S. 

It is an interesting consequence of the Jordan Curve Theorem that if we 
have two points in the complement of a simple closed curve S in the plane and 
we draw a line L from one point to the other, then if L crosses S an even number 
of times, the points are in the same component of the complement of S, and if 
L crosses S an odd number of times, the points are in different components of 
the complement of S. (See Figure 11.2.) 
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FIGURE 11.2: The points a and b are in the same component of the complement 
of S, but b and c are in different components. 


EXAMPLE 11.1. Consider the Jordan Curve Zoo in Figure 11.3. Is the 
zookeeper safe from the lion? Drawing a horizontal line L from the lion to 
the zookeeper, we see that L crosses the cage five times, and therefore the lion 
and the zookeeper are on opposite sides of the cage. If we consider the line L' 
running from the zookeeper to a point that we know is outside of the cage, we 
see that L' intersects the cage three times, and therefore we can conclude that 
the zookeeper is actually inside the cage. 



FIGURE 11.3: Is the zookeeper safe from the lion? 


j 


For simple closed curves in the plane, we can do better than the Jordan 
Curve Theorem. The following theorem indicates that not only is a simple 
closed curve homeomorphic to 5 1 , but the entire topological structure of a 
simple closed curve sitting in the plane is topologically equivalent to the circle 
S 1 sitting in the plane. 
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THEOREM 11.4. The Schonflies Theorem. Let f : S l R 2 be an 

embedding. Then there exists a homeomorphism F : R 2 —> R 2 such that 

F\ s i=f- 

This theorem is named after Arthur Schonflies (1853-1928), who first pre¬ 
sented it in 1906. We do not prove it; a proof can be found in [Moi]. The 
Schonflies Theorem states that every embedding of the circle into the plane 
extends to a homeomorphism of the whole plane to itself. (See Figure 11.4.) 
Such a homeomorphism must map the inside of S 1 in the plane to the inside of 
the simple closed curve /(S 1 ) and the outside of S 1 to the outside of /(S 1 ). 
(See Exercise 11.6.) Consequently, given a simple closed curve S in the plane, 
the inside of S is homeomorphic to the open ball B = {(x, y) | x 2 + y 2 < 1} 
as a subspace of the plane, and the outside of S is homeomorphic to the sub¬ 
space C = {(*, y) | x 2 + y 2 > 1}. Thus the Schonflies Theorem strengthens 
the Jordan Curve Theorem, indicating exactly (topologically) what the two 
components are that result from the simple closed curve separating the plane. 



FIGURE 11.4: The embedding f : S l -*■ R 2 extends to a homeomorphism F : 

K 2 K 2 . 


If we increase the dimension of the embedding space by one, and look at 
simple closed curves in R 3 , we find a very different situation. As indicated in 
Section 4.2, simple closed curves in R 3 are known as knots. The topological 
possibilities for knots are endless and are the subject of a very active area of 
study in topology. We explore knots further in Chapter 12. 

We can see from considerations of embeddings of S 1 into K 2 and into R 3 
that mathematics often presents a win-win situation. On the one hand we have a 
significant theorem (the Schonflies Theorem) indicating that all embeddings of 
S l into R 2 are essentially the same. On the other hand, it is not the case that all 
embeddings of S 1 into R 3 are the same, and therefore the task of determining 
all of the possibilities for these embeddings develops into an important area of 
mathematical research. 

Next, let us consider some of our familiar surfaces: the torus, sphere, Klein 
bottle, projective plane, and Mobius band. The sphere and torus were both 
initially defined as subspaces of 3-space, so they trivially embed in 3-space. 

The usual picture of the Mobius band illustrates an embedding in R 3 . In 
fact we can define a specific embedding of the Mobius band. We view the 
Mobius band as the rectangle /? = [0,2;r] x [— 1,1 ] with the pairs of points of 
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the form (0, a ) and (2n, —a) identified. Using (r, 9, z) cylindrical coordinates 
in R 3 , we define / : R -» R 3 by f(s, t) = (r(s, t), 9(s, t ), z(s, t )), where 

r(s, t) = 5 + t cos(i/2), 9(s, t) = 5, z(s, t) = t sin(s/2). 

Since /(0, a) = f(2n, —a), it follows that / defines a function /* map¬ 
ping the Mobius band to R 3 . The function f* is an embedding. It takes the 
core circle in the Mobius band, where t = 0, to the circle of radius five in the 
xy-plane. For each fixed s e [0, 2tt], f* maps the line segment { 5 } x [—1,1] 
to a line segment in the plane 9 = s. As s runs from 0 to 2 n, this segment es¬ 
sentially rotates halfway around, sweeping out the image of the Mobius band. 
(See Exercise 11.8.) 

Consider the Klein bottle and the projective plane. As we indicated previ¬ 
ously, neither can be embedded in M 3 . However, with just one more dimension 
at our disposal (that is, if we consider R 4 ), we can define embeddings. We 
show how for the Klein bottle; a similar approach can be used for the projective 
plane. (See Exercise 11.9.) 

The usual representation of the Klein bottle K in R 3 can be viewed as the 
image of a function / : K -» R 3 . (See Figure 11.5.) The function / is not an 
embedding because it is not one-to-one as a result of the self-intersection. We 
call this the standard M 3 representation of K. 




/ 


FIGURE 11.5: Mapping the Klein bottle into R 3 . 

In Figure 11.6 we depict the Klein Bottle as a square with its edges iden¬ 
tified. Within that depiction we have illustrated the set D that consists of the 
points on the Klein bottle where the self-intersection occurs in the standard 
R 3 representation (that is, the points in K where / fails to be one-to-one). 
We define a continuous function h : K -» R by choosing h to be constant 
on horizontal lines in the square, and by having h increase from 0 to 1 and 
then decrease back to 0 as we move vertically from the bottom to the top of 
the square. The idea is to have h(x 1 ) ^ h(x 2 ) if x\ and xi are points in D 
corresponding to the same point in the standard R 3 representation of K. The 
function h serves as one coordinate of the embedding of K into R 4 . 

Define a function F : K -» R 4 by having the first three coordinates of 
F(x) in R 4 equal f (x) and the last coordinate equal h(x). In this case, F is 
one-to-one because if x and y are such that x ^ y and / (jc) = /(y), then the 
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h = 0 
h= 1 

h =~2 

h = 0 



FIGURE 11.6: Defining h : K ->• M, one coordinate of an embedding of K into R 4 . 

function h is defined so that h{x) ^ h(y). Therefore for every x,y e K, if 
x ^ y then F(x) ^ F(y), and it follows that the function F is an embedding 
of K in R 4 . 

It is natural to ask if there are higher-dimensional analogs of the Jordan 
Curve Theorem and the Schonfiies Theorem. For example, does every embed¬ 
ded sphere in R 3 separate R 3 into two components with the sphere being the 
boundary of each (as in the Jordan Curve Theorem)? And, if so, are the two 
components topologically equivalent to the two components resulting from the 
standard embedding (as in the Schonfiies Theorem)? The answer to the first 
of these questions is yes, but to the second is no. 

First, let us address the affirmative answer. Not only does the Jordan Curve 
Theorem extend to embeddings of the sphere in R 3 , but it also generalizes to 
all dimensions: 

THEOREM 11.5. The Jordan-Brouwer Separation Theorem. Let 

S be an embedded image of S" in R" +1 . Then R ” +1 — S consists of two 
components, and S is the boundary in R ” +1 of each of the components. 

L. E. J. Brouwer proved this result in 1912. This theorem is straightforward 
to prove, given appropriate tools from algebraic topology. (See [Vic], for 
example.) A proof that does not require tools from algebraic topology can be 
found in [Dug]. 

The question of whether or not it is possible to generalize the Schonfiies 
Theorem to higher dimensions was not resolved until 1924, when J. W. Alexan¬ 
der (1888-1971) discovered a counterexample, now known as the Alexander 
Homed Sphere. The example consists of a sphere embedded in R 3 such that 
the outside of the sphere is not homeomorphic to the outside of S 2 in R 3 . 

The Alexander Homed Sphere is the embedded sphere A shown in Fig¬ 
ure 11.7. Between the two main horns seen in the figure, there is intricate detail 
that we describe and illustrate further in a moment. The loop L in Figure 11.7 
lies in the outside of A, but it cannot be deformed to a point in the outside of A 
(we discuss why shortly). Therefore the outside of A is not simply connected. 
On the other hand, the outside of S 2 is homeomorphic to R 3 — {O}, which is 
simply connected by Theorem 9.20. Thus, the embedding of S 2 to the Alexan¬ 
der Homed Sphere cannot be extended to a homeomorphism of R 3 to itself. It 
follows that the Schonfiies Theorem does not hold in dimension three. 
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To construct the Alexander Homed Sphere, we begin with a sphere and 
push out two horns as shown on the left in Figure 11.8. Next, in the second 
step in the construction, we push out a pair of hooked horns in the gap between 
the ends of the first two pushed-out horns, as shown on the right in the figure. 
The result is still an embedded sphere because there is a gap between the ends 
of each horn and they do not close up. 



FIGURE 11.8: Pushing out the Alexander Horned Sphere horns. 

We continue this construction process; as shown on the right in Figure 11.8, 
in each step we push out two smaller hooked horns in the gap between the ends 
of each pair of horns formed in the previous step. At the end of the nth step 
we have 2"~ 1 gaps, and therefore in the next step we push out 2" more hooked 
horns. The result, after carrying out this process for all n e Z+, is the Alexander 
Homed Sphere. 

Now, why is it that the loop L in Figure 11.7 cannot be deformed to a point 
in the outside of A? Roughly, the idea is as follows: If we could deform L to a 
point, then we would have a continuous function F mapping the disk D into the 
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outside of A that, when restricted to its boundary, S 1 , is an embedding of the 
circle onto L. Such a function would need to have the image of the disk pass 
between the hooked horns in A. Since D is compact, F(D ) is a compact subset 
of M 3 . Also, F{D) is disjoint from A, which is a compact set in R 3 . These 
two disjoint compact sets in R 3 must have a positive distance, say e, between 
them by Theorem 7.25. In the construction of A, the hooks eventually get to 
the point where the gaps between them are smaller than e, and therefore the 
image of F cannot get through those gaps to pass between the hooked horns. 

Exercises for Section 11.1 

11.1. (a) Prove that there is no embedding / : S 1 -*■ R. 

(b) Prove that an embedding / : S 1 -* S 1 must be surjective and therefore 
must be a homeomorphism. 

11.2. Prove that being embeddable into a particular space is a topological invariant. 
Specifically, prove that if Y is a topological space and X and X' are homeo- 
morphic, then X embeds in Y if and only if X’ embeds in Y. 

11.3. (a) Use the Schonflies Theorem to prove that every simple closed curve in 

the sphere separates the sphere into two components, each of which is 
homeomorphtc to an open disk. 

(b) On each of the torus, the Klein bottle, and the projective plane, sketch two 
simple closed curves, one that separates the space and one that does not. 

11.4. Prove the Schonflies Theorem for embeddings / : S° -*■ R. That is, prove 
that if / : S° -»• R is an embedding, then / extends to a homeomorphism 
F : R -> R. 

11.5. An embedding / : S 1 -> S 2 is called a star embedding if there exists a point 
p on the inside of f(S l ) and a continuous function r : S l -*■ (0, oo) such that, 
with polar coordinates centered at p, f can be expressed as /( 0 ) = (r(8), 6). 
(See Figure 11.9.) Prove the Schonflies Theorem for star embeddings. 



FIGURE 11.9: A star embedding. 

11.6. Show that if / : M 2 —»■ R 2 is a homeomorphism, then / maps the inside of S 1 
to the inside of the simple closed curve /(S 1 ) and maps the outside of S 1 to 
the outside of f(S l ). 

11.7. Show that the Schonflies Theorem implies that if / : D -» R 2 is an em¬ 
bedding, then R 2 — f(D) is connected. (Therefore, no subspace of the plane 
homeomorphic to a disk separates the plane.) 
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11.8. Illustrate the embedding of the Mobius band in R 3 given by the function /* 
described in this section. In particular, illustrate how the image of /* appears 
on the cylindrical-coordinate planes of fixed angle 9, as 9 runs from 0 to 2ir. 

11.9. Describe how the projective plane can be embedded in M 4 . 


11.2 The Jordan Curve Theorem 

In this section we prove the Jordan Curve Theorem. This intuitively straight¬ 
forward result is surprisingly difficult to prove. Many different proofs, with 
a variety of different approaches, have been published over the past century 
since the theorem was first proved by Veblen in 1905. The Jordan Curve 
Theorem may be the most-proved theorem in topology. The approach that 
we present uses the two-dimensional versions of the No Retraction Theorem 
and the Brouwer Fixed Point Theorem (which, as shown in Section 10.1, are 
equivalent theorems). It also uses Theorem 9.14, which asserts the existence 
of retractions from the disk onto arc subspaces. Our proof is based on the proof 
in [Mae]. 

Along with the proof of the Jordan Curve Theorem, we also present a 
nonseparation theorem (Theorem 11.10) indicating that no arc in the plane 
separates the plane. This result is essentially proven in the process of estab¬ 
lishing the Jordan Curve Theorem. 

Before we prove the Jordan Curve Theorem, we present a few results that 
we need in the proof. 

We begin with a theorem that states that if we have a rectangle with a path 
running from the left side to the right side and a path running from the bottom 
to the top, then the paths must have a point in common. The Two-Dimensional 
Brouwer Fixed-Point Theorem is used to prove this result. 

Previously, we defined a path in a topological space X to be a continuous 
function / : [0, 1] —>• X. Here, for convenience, we regard paths as having 
domain [—1, 1], rather than [0, 1]. This does not pose a problem since the 
spaces [0,1] and [—1, 1] are homeomorphic. As before, we also use the term 
path to refer to the image of the function that is the path. 

Consider the rectangle /? = [—l,l]x[—1,1]. Let B be the boundary of 
R as a subset of the plane; so B = {(s, t) € R | |s| = 1 or |/| = 1}. 

THEOREM 11.6. Let u, v : [—1, 1] -*■ R be paths in R such that u(— 1) = 
(—1,0), «(1) = (1,0), u(0) = (0, —1), and u(l) = (0, 1). Then there exist 
s,te [—1, 1] such that u(s) = v(t). (See Figure 11.10.) 

Proof. Assume that no such s and t exist, and therefore u(s) v(t) for 
alls,? e [—1, 1]. Expressing u(s) and v(t) in terms of their coordinates, 
we have «(i) = (niCs), U 2 (s)) and v(t) — (iq 00, V 2 (t)). For each 
s, t e [— 1,1], we define 

M(s, t) = max[|wi(s) - ui(f)l, l« 2 (s) - v 2 (t)\). 

Note that M(s, t) > 0 for all 5 and t. 
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FIGURE 11.10: Paths u and v in R must cross. 


Next, define F :/?-*/? by 


F(s, t) 


/ ni(r) - Ml(s) u 2 (s) - V2(t) \ 

V M(s,t ) ’ M(s,t) )■ 


The function F is continuous, and R is homeomorphic to the disk; 
therefore it follows by the Brouwer Fixed Point Theorem that F must 
have a fixed point (s*, t*) € R. Note that for each s, t e [—1, 1], either 


VI (Q - Ul(s ) 

M(s, t ) 


= 1 or 


U 2 (s) - v 2 (t) 
M(s,t) 


Therefore the image of F is a subset of B, the boundary of the rectangle 
R. Thus, the fixed point 0*, t*) must lie in B. We have four possibilities 
to consider: s* = — 1, s* = 1, t* = —1, or t* = 1. We claim that each 
of the four cases leads to a contradiction, and therefore there must be 
s,t € [—1, 1] such that u(s) = v(t). Here we derive a contradiction in the 
case that s* = — 1 . The remaining cases are addressed in Exercise 11 . 12 . 

Thus assumes* = — 1. Then v '* = — 1. But 


( Ml (s*), M 2 (s*)) = U(s*) = «(-l) = (-1, 0), 


and therefore ui(s*) — — 1. This implies that — > 0, which is 

a contradiction. ■ 


THEOREM 11.7. Let U be an open subset of the plane. Then U is connected 
if and only if it is path connected. 

Proof. By Theorem 6.28, path connectedness implies connectedness; 
consequently we only need to prove that if U is connected, then it is path 
connected. Therefore, suppose that U is connected. If U is empty, then 
the result trivially holds; thus assume that U is nonempty, and pick a 
point p e U. Let Y p be the set of points q 'mU such that there exists a 
path in U from p to q, and let N p be the set of points q in U such that no 
path exists in U from p to q. The sets Y p and N p are disjoint and their 
union is U. 
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We claim that Y p and N p are both open sets. To prove the claim, 
we prove that Y p is open; the proof for N p is similar. Suppose q e Y p . 
Then there is a path / in U from p to q. Since U is open, it follows that 
in the standard metric on R 2 there is an open ball B that is centered at 
q and contained in U. Pick a point q' e B. There is a path g in B (and 
therefore in U) from q to q'. Define a path h by having it follow / from 
p to q and then follow g from q to q'. The path h is a path in U from p 
to q'. Therefore every q' in B is also in Y p . Thus, for arbitrary q e Y p 
there is an open ball B such that q € B C Y p . It follows that Y p is an 
open set. 

Now, Y p and N p are disjoint open sets whose union is U. If Y p and 
N p were both nonempty, then they would form a separation of U, which 
is not possible since U is connected. Therefore either Y p or N p is empty. 
Since p € Y p , it must be N p that is empty. This implies that U = Y p , 
and thus there exists a path in U from p to every other point in U. Hence, 
U is path connected. ■ 


THEOREM 11.8. Let U be an open subset of the plane. Then the components 
ofU are open subsets of the plane. 

Proof. See Exercise 11.10. ■ 

By combining Theorems 11.7 and 11.8, it follows that if U is an open 
subset of the plane, then the components and path components of U coincide 
and are open subsets of the plane. 

The last theorem that we need before beginning our proof of the Jordan 
Curve Theorem is the following consequence of the Two-Dimensional No 
Retraction Theorem: 

THEOREM 11.9. Let D be the disk in the plane. Iff : D —► Discontinuous 
and f(x) = xfor all x € S 1 , then f is surjective. 

Proof. SeeExercisell.il. ■ 

Theorem 11.9 indicates that if / is a continuous function that maps the 
disk to itself and is the identity on its boundary—the circle S 1 —then / must 
map onto the disk. 

Now we are ready to prove the Jordan Curve Theorem, which states that if 
S is a simple closed curve in the plane, then R 2 — S has two components and 
S is the boundary of each component. 

We prove this in three parts: first we show that R 2 — S is not connected, 
then we show that S is the boundary of each component of R 2 — S, and finally 
we show that R 2 — S has two components. A few straightforward verifications 
in the proofs are addressed in the exercises. 

Proof (R 2 — Sis not connected). Let <J(x, y) : S x S -*■ R be the func¬ 
tion defining the distance between x and y in the standard metric on the 
plane. The domain of d is compact and d is continuous; therefore d takes 
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on a maximum value on S x S. It follows that there is at least one pair 
of points 51 , 52 in S that are a maximum distance apart. Let J be the 
line segment in the plane connecting 5i and 52 , and let Pi and P 2 be the 
lines perpendicular to J passing through 5i and 52 , respectively. (See 
Figure 11.11.) The points 5i and 52 are the only points of S lying on 
Pi U Pi, and the rest of S lies between Pi and Pi. (See Exercise 11.13.) 
Since S is compact and therefore bounded, there exist lines J\ and Ji, 
parallel to and equidistant from J, such that S — { 51 , 52 } lies in the interior 
of the bounded rectangle R' bordered by Pi, Pi, L\, and Li. 



FIGURE 11.11: The simple closed curve S lies in the rectangle bordered by P\, Pi, 
L\, and L 2 . 


By rotating, translating, and scaling we can obtain a homeomorphism 
h from the plane to itself, mapping R' to R = [—1,1] x [—1,1] and 
sending 51 and 52 to (—1,0) and (1,0), respectively. (See Figure 11.12.) 
Thus, it suffices to assume that S lies in the rectangle R and that S 
intersects the boundary of R only at the points 51 = (—1,0) and 52 = 
( 1 , 0 ). 



FIGURE 11.12: The rectangle R' maps homeomorphically to [-1,1] x [-1,1], 

Now, S forms two arcs in R, running from 5] = (—1,0) to 52 = 
(1,0). We denote these arcs by S' and S". Consider the vertical line seg¬ 
ment L in R connecting a — (0, — 1) to b = (0, 1). Using Theorem 11.6, 
we can conclude that L intersects both S' and S". Traveling from a to b 
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along L, we may assume, without loss of generality, that L intersects S' 
before it intersects S". Let a! be the first point of intersection of L with S' 
as we go from a to b along L, and let b' be the last point of intersection. 
(See Figure 11.13.) 



FIGURE 11.13: Going from a to b along L, the points a' and b' are the first and last 
intersections, respectively, with S'. 


We claim that L intersects S" between b' and b. If it did not, then we 
could take a path, denoted aLa'S'b'Lb, that goes from a to a' along L, 
then from a' to b' along S', and finally from b' to b along L. Such a path 
would go from a to b in R without intersecting S". (See Exercise 11.14.) 
This is impossible by Theorem 11.6. Therefore L intersects S" between 
b' and b. Now, as we go from b' to b along L, let a" be the first point of 
intersection between L and S" and let b" be the last. 

Consider the point c lying halfway between b' and a" on L. It follows 
that c lies on neither S' nor S". Therefore c is in the complement of S in 
R 2 . Let U be the component of R 2 — S that contains c; we claim that U 
is bounded. 

To show that U is bounded, suppose that it is not. By Theorems 11.7 
and 11.8, U is path connected, and since U is unbounded, we can find 
a path P in U from c to the outside of R. Let p' be the first point on P 
where P intersects the boundary of R. The point p' is in either the bottom 
half or the top half of the boundary of R. Suppose that p' is in the bottom 
half. Let Q be a path in the boundary of R running from a to p'. Using 
the same notational convention as above, a path aQp' PcLb goes from 
the bottom to the top of R and does not intersect S'. (See Exercise 11.14.) 
This contradicts Theoreirpl 1.6. Next suppose that p' is in the top half of 
the boundary of R. Let Q' be. a path in the boundary of R running from 
p' lob. Then a path aLa'S'b'LcPp' Q'b goes from the bottom to the top 
of R and does not intersect S”. (See Exercise 11.14.) This also yields 
a contradiction to Theorem 11.6. Thus, in either case— p' in the bottom 
half of the boundary of R, or p' in the top half of the boundary of R —we 
have derived a contradiction, and therefore U is bounded. 
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Since S is a bounded subset of the plane, it follows that R 2 — S has an 
unbounded component. Therefore R 2 — S has at least two components, 
one bounded and one unbounded, implying that R 2 — S is not connected. 


In the next part of the proof of the Jordan Curve Theorem, we show that S 
is the boundary of each component of R 2 — S. 


Proof (S is the boundary of each component of R 2 — S). To begin, 
note that Theorem 11.8 implies that each component of R 2 — S is an 
open set. Let V be such a component. Since the other components of 
R 2 — S are all open sets, it follows that V U S is closed and therefore 
C1(F) C f US. Now, Int( V) = V since V is open, and therefore 
C1(F) — Int(V) C S, implying that dV C S. Therefore, the bound¬ 
ary of each component of R 2 — S is a subset of S. We need to show 
that the boundary of each component of R 2 — S equals S. We do so by 
contradiction. 

Thus let V be a component of R 2 — S and assume that the boundary 
of V does not equal S. Therefore 3 V is a closed proper subset of S. It 
follows that there is an arc A in S containing 3 V. We consider the cases 
where V is bounded and where V is unbounded separately. 

First assume that V is bounded. Let B be a closed ball in the plane 
that is large enough to contain V U S in its interior. By Theorem 9.14, 
there is a retraction r : B A. Define / : B -* B by 


fix) = 


r(x) if x € C1(V), 
x if x e B — V. 


Note that C1(V) and B — V are closed subsets of B. Furthermore, 
C1(F) n {B — F) c A and r{x) = x for x e A. The Pasting Lemma 
implies that / is continuous. Now, / equals the identity when restricted 
to the boundary circle of B. Therefore Theorem 11.9 implies that / is 
surjective. However, it follows from the definition of / that no point in 
V is in the image of /, and thus we have a contradiction. 

Now assume that V is not bounded. Let B be a closed ball in the 
plane that is large enough to contain in its interior the simple closed curve 
S and all of the bounded components of R 2 — 5. By the first part of the 
proof, there is at least one bounded component of R 2 — 5; denote it by 
U. Again, by Theorem 9.14, there is a retraction r : B — A. Define 
g : B -> B by 


g(x) = 


x if x e C1(F) fi B, 

r(x) if x e B - V. 


The function g is continuous and equals the identity when restricted 
to the boundary circle of B. Theorem 11.9 implies that g is surjective, 
but it follows from the definition of g that no point in U is in the image 
of g. Therefore, we have a contradiction in this case as well. 

Thus S is the boundary of each component of R 2 — 5. ■ 
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In the last part of the proof of the Jordan Curve Theorem, we show that 
R 2 — S has two components. We use the same setup and notations as in the 
first part of the proof. 

Proof (R 2 — S has two components ). Let U be the bounded component 
of R 2 — S whose existence was established in the first part of the proof. 
Assume that there is another bounded component, W. We derive a con¬ 
tradiction that allows us to conclude that R 2 — S has one bounded com¬ 
ponent. Since R 2 — S also has one unbounded component, it then follows 
that R 2 — S has two components. 

Consider the arc A in R given by aLa' S'b' La" S"b" Lb. It runs from 
the bottom of R to the top and is disjoint from W. (See Exercise 11.15.) 
The points 5j = (—1,0) and S 2 = (1, 0) do not lie in A. Since A is 
closed, there exist open balls B\ and £2 in the plane, centered at 5] and 
52, respectively, that are disjoint from A. By the second part of the proof, 
the points si and S 2 lie in the boundary of W. Therefore there exist points 
w i and t 02 in B\ n W and B 2 H W, respectively. Let M be a path in B\ 
from 5i to to 1 , let M' be a path in W from w 1 to 102 , and let M" be 
a path in B 2 from 102 to 52- Then a path s\Mw\M'\U 2 M"s 2 is a path 
in R that goes from 5i to 52 and does not intersect A. This contradicts 
Theorem 11.6. Thus there is only one bounded component in R 2 — S, 
and the proof of the Jordan Curve Theorem is complete. fl 

The following is a nonseparation theorem that is closely related to the 
Jordan Curve Theorem. We ask you to prove it in Exercise 11.16. 

THEOREM 11.10. Let A be an arc in the plane. Then R 2 — A is connected, 
and A is the boundary of R 2 — A. 

Proof. See Exercise 11.16. B 

Exercises for Section 11.2 

11.10. Prove Theorem 11.8: Let U be an open subset of the plane. Then the compo¬ 
nents of U are open subsets of the plane. 

11.11. Prove Theorem 11.9: Let D be the disk in the plane. If /:£)-> D is 
continuous and f{x) = x for all x e S 1 , then / is surjective. (Hint: Use the 
No Retraction Theorem.) 

11.12. In the proof of Theorem 11.6, show that you arrive at a contradiction in each of 
the cases s* = 1, t* = —1, and r* = 1, for the coordinates of the fixed point 
(5*, t*) of the function F. 

11.13. For the Jordan Curve Theorem proof that R 2 — S is not connected, provide 
arguments for the claims that the points 5 ] and 52 are the only points of S lying 
on Pi U P 2 and that the rest of S lies between Pi and P2. 

11.14. For the Jordan Curve Theorem proof that R 2 — 5 is not connected, provide 
arguments for the following claims: 

(a) A path aLa'S'b’Lb does not intersect S". 

(b) A path aQp'PcLb does not intersect S'. 

(c) A path aLa'S'b'Lc Pp'Q'b does not intersect S". 
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11.15. For the Jordan Curve Theorem proof that R 2 — S consists of two components, 
provide the argument for the claim that the arc aLa!S'b'La"S"b"Lb is disjoint 
from W. 

11.16. Prove Theorem 11.10: Let A be an arc in the plane. Then R 2 — A is connected, 
and A is the boundary of R 2 — A. This is done in three parts: 

(a) Prove that if C is a closed subset of a topological space X , then the boundary 
of the complement of C is a subset of C. Use this to show that 3 (R 2 — A) c 
A. 

(b) Prove that R 2 — A is connected. (Hint: Use an argument similar to one 
used in the Jordan Curve Theorem proof that S is the boundary of each 
component of R 2 — 5.) 

(c) Prove that A c 3(R 2 — A). (Hint: Assume that A is not a subset of 
3(R 2 — A), and argue why there must be a point in A with an open-ball 
neighborhood that is disjoint from R 2 — A. Then, explain why that is an 
impossibility.) 


11.3 Digital Topology and Digital Image Processing 

Digital images have become one of the primary means for communicating 
visual information. The pictures in a digital camera, the illustrations in this 
book, the graph on a calculator screen, and the display on a ballpark scoreboard 
are examples of images that are digitally constructed or presented. The field of 
digital image processing is concerned with the creation, storage, manipulation, 
and presentation of digital images. In each of these aspects of digital image 
processing, there are questions and problems that involve topological concepts 
and tools. For example, 

(i) In creating a digital image, how can it be ensured that spatial re¬ 
lationships between real-world features are properly depicted in a 
digital representation of them? 

(ii) In storing a digital image, are there characteristics of the image’s 
structure that allow for a more efficient method than storing infor¬ 
mation about every individual pixel? 

(iii) In digitally transforming an image, how can the topological aspects 
of the image be preserved? 

In this section we consider question (ii) and present topological results that 
suggest an efficient storage process for digital images. These results include 
a digital version of the Jordan Curve Theorem. Our presentation is based on 
results from [Kha3]. 

The topological setting for this section is the digital plane, a space obtained 
by taking the product of two digital lines. We introduced the digital plane in 
Section 1.4, where we presented it as the topological space defined on Z x Z 
via the basis of sets 
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B(m, n) 


{( m,n )} 

{(m + a, n) \ a = — 1 , 0 , 1} 

{( m , n + b) | b — —1, 0, 1} 

{(m + a,n + b) \ a,b = —1, 0, 1} 


if m and n are odd, 
if m is even and n is odd, 
if m is odd and n is even, 
if m and n are even. 


We denote the digital plane by Z 2 . For (m, n ) e Z 2 , we refer to B(m, n) 
as the minimal basis element containing (m, n). We illustrate some of these 
basis elements in Figure 11.14. In the illustration we use different symbols for 
each of the different types of points in the digital plane. Points of the form 
( m , n) with both m and n odd are called open points. We denote them by 
O. Each of these points is a single-point open set in the digital plane. Points 
(m, n), with both m and n even, are called closed points. We denote the 
closed points by reflecting the fact that the minimal basis element for a 
point & contains the eight points surrounding it. Points (m, n), with one odd 
and one even coordinate, are called mixed points. We denote the mixed points 
by either 0 or ♦, with 0 reflecting the fact that the minimal basis element for 
a point 0 contains the points directly above and below it. A similar situation 
holds for the points denoted ♦. 


0 * 0 * 0 « 0 * 0 * 0«0 

0 * 0 * 040 * 0 * 0*0 



FIGURE 11.14: Basis elements in the digital plane. 

The digital plane can also be obtained as a quotient space of the plane 
R 2 with the standard topology. (See Exercise 3.32.) Therefore, since the 
plane is path connected and the digital plane is the continuous image of the 
plane under the corresponding quotient map, it follows that the digital plane 
is path connected. In fact, given two points in the digital plane, we can define 
a path / : [0, 1] —> Z 2 , as illustrated in Figure 11.15, that traces a route 
traveling vertically, horizontally, or both, from one point to the other. (See 
Exercise 11.17.) 

The subspace of the digital plane consisting of all of the open points is 
called the visible screen; we denote it by V. In our digital-image-processing 
model, the visible screen corresponds to the set of pixels in a digital image 
display. That is, it corresponds to what we actually see in a digital picture. The 
visible screen is a dense and open subset of the digital plane, and it inherits 
the discrete topology as a subspace. The inclusion of the closed points and 
the mixed points provides an invisible structure that connects the pixels and 
allows for the use of topological concepts and results in modeling and studying 
properties of digital images. 
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FIGURE 11.15: Given two points in the digital plane, there is a path between them. 

With the digital plane as the setting for our model, let us consider the 
storage problem previously introduced. In Figure 11.16, shaded in gray, we 
present sets of digital-plane points that enclose 1 x 1, 2 x 2, and 3x3 arrays 
of pixels in the visible screen. 

***#*#*#*# 
*o*o*o«o»o 


FIGURE 11.16: Digital-plane sets enclosing pixel arrays. 

The surrounding sets in Figure 11.16 are made up of 8, 16, and 24 points, 
respectively. In general, it can be shown that an n x n array of open points 
can be enclosed by a surrounding set of 8 n points in the digital plane. (See 
Exercise 11.23.) Therefore, if we want to store a digital image of a square 
blue area, 1000 x 1000 pixels in size, we can either store the location of each 
of the one-million blue-colored pixels, or we can store the location of the 
8000 surrounding points in the digital plane, indicating that the corresponding 
enclosed points in the visible screen are colored blue. Clearly, there is a storage 
savings in working with the surrounding set rather than the entire enclosed set. 

Now, to what extent does this principle carry over to more general digital 
images? We model a digital picture as a collection of regions such that in 
each region every point has the same color. We would like to know if we can 
replace the regions with surrounding sets that uniquely determine the regions. 
In this way we could reduce the space needed to store a picture by storing 
only information about the surrounding sets and the colors. Since the picture’s 
regions are uniquely determined by the surrounding sets, we could recover 
the picture from the stored information. The remainder of this section is an 
introduction to some digital-topology results that help answer this question. 
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The surrounding sets in Figure 11.16 each inherit a topology as a subspace 
of the digital plane. It is easy to see that each of these subspaces is a digital 
circle, as introduced in Section 3.3. Digital circles C n are defined for even 
integers n >4. They are obtained by taking the subspace {1,..., n + 1} of the 
digital line and gluing the ends 1 and n + 1 together. In Figure 31.17 we show 
the digital circles C 4 , Cg, and C$, along with a basis for each. 




FIGURE 1117: Digital circles C 4 , C 6 , and Cg. 


DEFINITION 11.11. Let X be a topological space. A digital simple closed 
curve in X is a subspace of X that is homeomorphic to a digital circle. 


-i 

EXAMPLE 11.2. The four-point set A in Fig. 11.18 is a digital simple closed 
curve, but the four-point set B is not. The subspace topology that B inherits 
from the digital plane is the discrete topology. 


O 

♦ 

O 

♦ 

o 



FIGURE 11.18: The set A is a digital simple closed curve, but the set B is not. 
_| 


The Jordan Curve Theorem indicates that every simple closed curve sepa¬ 
rates the plane, M 2 , into two components, each of which has the simple closed 
curve as its boundary. Therefore a simple closed curve S in the plane deter¬ 
mines two regions that are surrounded by S, and these regions each determine 
S since S is obtained by taking the boundary of each. This is the type of result 
that we seek for regions and surrounding sets in the digital plane. Is there a 
corresponding Jordan Curve Theorem for digital simple closed curves in the 
digital plane? We will see shortly that the answer is yes, but with a minor 
difference. 

Considering the three digital simple closed curves in Figure 11.16 and the 
single digital simple closed curve in Figure 11.18, it is straightforward to show 
that each of these sets separates the digital plane into two connected subsets. 
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Also, in the three examples in Figure 11.16, the digital simple closed curve is 
the boundary of each of the components into which the set separates the digital 
plane. In contrast, the digital simple closed curve A in Figure 11.18 is not 
the boundary of each component of its complement in the digital plane. (See 
Exercise 11.18.) 

It follows that we cannot attain a digital version of the Jordan Curve Theo¬ 
rem that corresponds exactly with the standard version. However, we have the 
following theorem: 

THEOREM 11.12. The Digital Jordan Curve Theorem. Let Abe a 

digital simple closed curve in the digital plane. Then A separates the digital 
plane into two components. Furthermore, A is the boundary of each of the 
components if and only if A is a closed subset of the digital plane. 

We do not present a proof of Theorem 11.12, although it is accessible 
with the tools and concepts presented up to this point in the text. A proof, 
by induction on the number of points in the digital simple closed curve, is 
presented in [Kis]. The initial case in the induction proof addresses four-point 
digital simple closed curves. That case is a consequence of the following 
theorem: 

THEOREM 11.13. Every four-point digital simple closed curve consists of 
points (m — 1, n), (m + 1, n), (m, n — 1), and ( m , n + 1), two of which are 
open points and two of which are closed points. 

Proof. Let A be a four-point digital simple closed curve. Using Theo¬ 
rem 1.13, it readily follows that the collection of sets 

{B(p,q) D A | {p,q) e A} 

is a basis for the subspace topology on A. Since A is homeomorphic 
to the digital circle C 4 , this basis must include two three-point sets that 
intersect in a pair of points. That can only happen if A contains two 
closed points (m — 1 , n) and (m + 1 , n), or ( m , n — 1 ) and (m, n + 1 ). 
The resulting three-point sets must then intersect in a pair of open points, 
either the points ( m,n — 1) and (m, n + 1 ), or the points (m — 1 , n) and 
(m + 1 , n), respectively. ■ 

It follows from Theorem 11.13 that a four-point digital simple closed curve 
must appear either as the set A in Figure 11.18 or as a 90-degree rotation of it. 
Now, suppose we have a four-point digital simple closed curve 

A = {(m — 1, n), (m + 1, n), ( m, n — 1), (m, n + 1)}. 

IfweletCi = {(m,n)}andC 2 = Z 2 — (AU{(m, n))), then it is straightforward 
to show that C\ and C 2 are the components of the complement of A in I?. This 
verifies Theorem 11.12 for four-point digital simple closed curves. 



Section 11.3 Digital Topology and Digital Image Processing 371 


The primary difference between the standard and digital versions of the 
Jordan Curve Theorem is that a simple closed curve in the standard plane is 
automatically the boundary of each component of its complement, while a dig¬ 
ital simple closed curve in the digital plane is the boundary of each component 
of its complement if and only if the digital simple closed curve is a closed 
set. It is noteworthy that a simple closed curve in the standard plane must be 
a closed set. (See Exercise 11.21.) Therefore, in the standard Jordan Curve 
Theorem it is not relevant to address simple closed curves that are not closed 
sets, in contrast to the Digital Jordan Curve Theorem where both possibilities 
for digital simple closed curves—being closed or not—are considered. 

The following theorem provides a simple condition for determining whether 
or not a digital simple closed curve is a closed subset of the digital plane. 

THEOREM 11.14. Let A be a digital simple closed curve in the digital plane. 
Then A is a closed subset of the digital plane if and only if A does not contain 
any open points. 

Proof. See Exercise 11.22. ■ 

For example, each of the digital simple closed curves in Figure 11.16 
consists of only closed points and mixed points. Therefore each is a closed 
subset of the digital plane. However, the digital simple closed curve A in 
Figure 11.18 contains two open points, and therefore A is not closed in the 
digital plane. 

Given a digital simple closed curve in the digital plane, its complement 
consists of two components. We next show that, as with the standard Jordan 
Curve Theorem, one of these components is bounded while the other is not. 
First, we define what we mean by bounded. 

DEFINITION 11.15. A subset A of the digital plane is bounded if there exists 
M e Z+ such that \m\ < M and \n\ < M for all (m, n) e A. Otherwise, A is 
said to be unbounded. 

In other words, a subset A of the digital plane is bounded if A is contained 
in a square centered at the origin in the digital plane. 

THEOREM 11.16. If A is a digital simple closed curve in the digital plane, 
then one of the components of its complement is bounded and the other is 
unbounded. 

Proof. Let A be a digital simple closed curve in the digital plane. Since 
A consists of only finitely many points, A is bounded. Therefore, at 
least one of the components of its complement must be unbounded. Let 
Me Z+ be such that \m\ < M and \n\ < M for all (m, n ) e A, and let 
B and B* be the sets defined by 

B = {(in, n) e Z 2 | \m\ < M and \n\ < M}, B* = Z 2 - B. 
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It is straightforward to show that B* is connected; furthermore, 
A D B* = 0 . Thus, B* is contained in one of the components of the 
complement of A. It follows that B contains the other component of the 
complement of A, and therefore that component is bounded. ■ 

If A is a digital simple closed curve in the digital plane, then we call the 
bounded component of the complement of A the inside of A and denote it by 
Ins(A), and we call the unbounded component of the complement of A the 
outside of A and denote it by Out(A). The Digital Jordan Curve Theorem 
indicates that A = 9(Ins(A)) and A = 3(Out(A)) if and only if A is a closed 
subset of the digital plane. 

Now, suppose that we have a digital image—an image displayed on an 
array of pixels. Associated to the image, we define a partition V of the visible 
screen V in the digital plane. Each set in the partition corresponds to an image 
region in which every point has the same color. As mentioned previously, 
we would like to know if it is possible to store the image as a collection of 
surrounding sets that determine the regions. Thus, we would like to know if 
there is a collection of digital simple closed curves that determines the sets in 
the partition V. 

The topology on V is the discrete topology, and therefore V is totally 
disconnected. However, in order to formulate our desired theorem, we need to 
introduce an alternative notion of connectivity for the visible screen. 

DEFINITION 11.17. 

(i) Let p = (m,n) be a point in the visible screen. The points 
(m — 2, n), (m + 2, n), (m,n — 2), and (m,n + 2) are said to 
be 4-adjacent to p. 

(ii) A set C in the visible screen is said to be 4-connected if for every 
pair of points p,q e C there exists a sequence of points in C, 
p = p\, pz ,... , p n = q, such that pi+\ is 4-adjacent to pi for 
all i = 1,... , n — 1. 


Given a point in the visible screen, the points that are 4-adjacent to it are the 
four points in the visible screen that are directly next to it, either horizontally 
or vertically. In Figure 11.19 we show two sets, A and B, in the visible screen. 
The set A is 4-connected, while the set B is not. 

Although we do not need it here, we can similarly define 8 -adjacent and 
8 -connected. In 8 -adjacency, all eight open points surrounding a particular 
open point are considered to be 8 -adjacent to it. The notions of 4- and 8 - 
connectivity are important in the formulation of visible-screen versions of 
topological properties. These connectivity properties played a major role in 
the development of the field of digital topology by Azriel Rosenfeld. (See 
[Ros], for example.) 

The next definition describes the process by which an image in the visible 
screen is converted to a collection of surrounding sets in the digital plane. 
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FIGURE 11.19: The set A is 4-connected, but the set B is not. 


DEFINITION 11.18. LetV be a partition of the visible screen into 4-connected 
subsets, only one of which is unbounded. The subset of the digital plane de¬ 
fined by 


Sv = U 3 (Cl(D)) 

DeV 

is called the cartoon determined by V. 

Given V as in the definition, the cartoon determined by V is a collection, 
S-p, of sets that are obtained by taking (in the digital plane) the boundary of the 
closure of each set in the partition. In [Kha3] it is shown that Sp is a union of 
digital simple closed curves in Z 2 — V, the complement of the visible screen 
in the digital plane. Thus, we can regard the cartoon determined by V as a 
collection of surrounding sets. The following theorem indicates, as desired, 
that the partition V can be recovered from the cartoon it determines: 


THEOREM 11.19. Let V be a partition of the visible screen into 4-connected 
subsets, at most one of which is unbounded. Let Sp be the cartoon determined 
by V. Then, the collection V*, of subsets ofV, given by 

V* = {C fl V | C is a component of 2’? — Sp}, 
is a partition of the visible screen, and V* = V. 

We do not provide a proof of Theorem 11.19; it follows from results pre¬ 
sented in [Kha3]. 

In Theorem 11.19 it is necessary to assume that the sets in the partition 
V are 4-connected in order to have the partitions V* and V coincide. For 
example, if the set B in Figure 11.19 is one of the sets in the partition V, then 
the two 4-connected “components” of B would be separate sets in V*. 

Summarizing, we have the following process for storing and recovering a 
digital image: 

(i) Given a digital image, define a partition of the visible screen by 
having each set in the partition correspond to a region in the image 
with a fixed color. 
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(ii) In order to store the image, construct the cartoon determined by the 
partition. (We can also store the color information by indicating 
which side of each digital simple closed curve in the cartoon has 
which color.) 


(iii) Recover the partition by taking the intersection of the visible screen 
with each component of the complement of the cartoon in the digital 
plane. 

The challenge of digitally representing the real world, which we usually 
regard as connected and continuous, requires having a means to “discretize” im¬ 
ages into digital versions, while simultaneously having a connectivity structure 
to maintain real-world topological relationships. The digital plane provides an 
effective model for this process; it is composed of the totally disconnected, 
open, and dense visible screen that is used for the representation of images, 
along with the closed points and mixed points that comprise the invisible struc¬ 
ture that provides connectivity. 

Euclidean space has been a primary venue for the discipline of topology, 
but as we continue to rely heavily on digital processes for communicating, 
presenting, and interpreting information, it will be important to include digital 
spaces among those that we investigate. 


Exercises for Section 11.3 


11.17. Given two points in the digital plane, show that there is a continuous function 
/ : [0,1] -» Z 2 that traces a route traveling vertically, horizontally, or both, 
from one point to the other. 

11.18. Determine the boundary of each component of the complement of the digital 
simple closed curve A in Example 11.2. 

11.19. (a) Consider the four six-point subsets of the digital plane shown in Fig¬ 

ure 11.20. Illustrate a basis for the subspace topology on each, and prove 
that none of the sets is a digital simple closed curve. 


* 


* 

* 



FIGURE 11.20: None of these six-point sets is a digital simple closed curve. 
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(b) Prove that there are no six-point digital simple closed curves in the digital 
plane. 

(c) Prove that for every even integer n > 8 there is an n-point digital simple 
closed curve in the digital plane. 

(d) Show that there is a six-point digital simple closed curve in digital 3- 
space—the space Z x Z x Z obtained by taking the product of three 
digital lines. 

11.20. (a) Show that the eight-point set surrounding an open point in the digital plane 

is a digital simple closed curve. 

(b) Show that the eight-point set surrounding a closed point in the digital plane 
is a digital simple closed curve. 

(c) Show that the eight-point set surrounding a mixed point in the digital plane 
is not a digital simple closed curve. Does such a set separate the digital 
plane? 

11.21. (a) Prove that if S is a simple closed curve in the plane, R 2 , then S is a closed 

subset of the plane. 

(b) Why does your proof in (a) fail to carry over to show that if A is a digital 
simple closed curve in the digital plane, then A is a closed subset of the 
digital plane? 

11.22. Prove Theorem 11.14: Let A be a digital simple closed curve in the digital 
plane. Then A is a closed subset of the digital plane if and only if A does not 
contain any open points. 

11.23. Let A be an n x n array of open points in the digital plane. Prove that 3(C1(A)) 
is a digital simple closed curve consisting of 8 n points. 
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Knots 


In this chapter, we present the theory of knots. Recall that a knot is an embed¬ 
ding / : S 1 -*■ R 3 of a circle into R 3 . We consider two knots to be equivalent 
if one can be deformed into the other without passing through itself, as in Fig¬ 
ure 12.1. Two knots that are equivalent are said to be of the same knot type. 
We give a rigorous.definition of this equivalence shortly. 




FIGURE 12.1: These two knots are equivalent. 

Knot theory was bom in the mid-nineteenth century. Carl Friedrich Gauss 
(1777-1855) and his student Johann Listing (1808-1882) both contributed to 
the early study of knot theory. Listing’s work, “Vorstudien zur Topologie” 
(where the term “topology” was first used), included a discussion about the 
problem of classifying knots. 

Throughout most of the nineteenth century scientists believed that there 
was an invisible substance called ether that pervaded all of space. The Scottish 
physicist William Thomson (1824-1907), also known as Lord Kelvin, posited 
that atoms were simply knotted vortices in the ether (like the smoke rings one 
can blow into the air). Each distinct knot was thought to correspond to a distinct 
element. 

Motivated by Thompson’s conjecture, James Clerk Maxwell (1831-1879) 
and Peter Guthrie Tait(1831-1901)studied properties of knots, with Tait work¬ 
ing on the development of a catalog of different knots. Ultimately, the list of 
knots did not correlate well with the list of elements. And, with the Michelson- 
Morley experiment (1887), it became apparent that there was no ether, and thus 
a death blow was dealt to this model of the atom. But by then, there was suffi¬ 
cient interest in cataloging knots, solely from a mathematical perspective, that 
work continued in this direction. 

Following the development of the foundations of topology in the early 
twentieth century, mathematicians were able to formalize the theory of knots. 
Since then, many mathematical tools have been developed and applied to knot 
theory, one of the most active areas of topology. 

The most fundamental question in knot theory is how to tell whether or not 
two knots are equivalent. Given pictures of two knots, we wish to determine 
whether or not it is possible, when the knots are tied in a string, to rearrange 
one of the knots to look like the other. 


376 
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One way to prove that two knots are not equivalent is to use quantities, 
called invariants, that we associate to knots. Invariants depend only on the 
type of the knot, and not on a particular picture of it. When we have two knots 
that have different values of an invariant, we know that the knots are distinct. 

In Section 12.1, we introduce isotopy and use it to give a rigorous definition 
of knot equivalence. In Section 12.2, we introduce the Reidemeister moves, 
which are moves we perform on a knot projection (a picture of a knot) that 
preserve the type of the knot. Reidemeister moves provide us with a straightfor¬ 
ward means for proving that the invariants we define actually are independent 
of the type of the knot. In this section, we also introduce linking number, an 
invariant for distinguishing so-called links, which are configurations made up 
of a collection of knots. In Section 12.3, we introduce polynomial invariants 
for knots, an innovation dating from the 1920s and 1930s, but which expanded 
dramatically in the 1980s. In the last section, we introduce some recent appli¬ 
cations of knots to the study of DNA and to synthetic chemistry. 

12.1 Isotopy and Knots 

In Section 9.1, we introduced the concept of a homotopy F : X x I -*■ Y 
between two continuous functions / : X -» Y and g : X -*■ Y to be a 
continuous function F(x, t ) such that F(x, 0) = f(x) and F(x, 1) = g(x). 
As the coordinate t varies from 0 to 1, the function F transforms / into g. In 
knot theory, we are interested in a particular type of homotopy. 


DEFINITION 12.1. A homotopy F : X x I Y is called an isotopy if 
F|xx)f| is a homeomorphism for all t in /. 

In an isotopy F : X x I Y, as t varies, we think of the isotopy as a 
one-parameter family of homeomorphisms F|x x {/) defined on X. 


EXAMPLE 12.1. Define F : R 2 x / ->■ R 2 by F{x, t) = (t + l)x. When 
t = 0, F is just the identity map on R 2 . But as t grows from 0 to 1, each vector 
in R 2 beginning at the origin is stretched in length until the end of the isotopy, 
when they are all twice as long as they originally were. (See Figure 12.2.) 



FIGURE 12.2: The isotopy F stretches each vector to twice its length. 

_____ | 
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We can think of an isotopy as a deformation of a space X over time that 
does not change the topology of X. We are particularly interested in whether 
or not two embeddings of a space Y in another space X can be deformed to 
each other by an isotopy of the space X containing them. In this way, we define 
what it means for two embeddings to be equivalent. 

DEFINITION 12.2. If f : Y -> X and g : Y -> X are embeddings ofY 
into X, then we say that f and g are ambient isotopic if there is an isotopy 
F : X x / —► X such that F(x, 0) = xfor all x e X and F(f(y), 1) = g(y) 
for all y e Y. The space X is called the ambient space and the function F is 
called an ambient isotopy. 

If F : X x / -» X is an ambient isotopy, then when restricted to t = 0, 
F is just the identity function on X, which fixes all points in X. But when 
restricted to t = 1, F sends the image of a point under / to exactly the same 
place as the image of the point under g. The ambient isotopy deforms the entire 
space X in a continious manner, so that by the end the function / has been 
deformed to the function g. 

Notice also that although the conditions F(x, 0) = x and F( f(y), 1) = 
g(y) appear quite different, the first condition does imply that F(f(y), 0) = 
fiy). So we can think of these conditions as implying that at time t = 0, F 
sends f(y) to fiy) and at time t = 1, F sends f(y) to g(y). 

Although complicated to state, the definition of an ambient isotopy captures 
the true spirit of rubber-sheet geometry. The image of the first embedding is 
slowly transformed to the image of the second as the ambient space is deformed. 
Notice that the image of the first embedding is, by definition, homeomorphic to 
the second. At each stage, a homeomorphic copy of Y appears in the ambient 
space, and we watch a movie of it as it is transformed from the initial version to 
the final version. (See Figure 12.3.) Ambient isotopy defines an equivalence 
relation between embeddings of a space Y into a space X. (See Exercise 12.3.) 



FIGURE 12.3: An isotopy resembles a movie of the deformation. 


EXAMPLE 12.2. Define / : S 1 -> R 2 by fid) = (cos(0), sin(0)). Thisem- 
beds the circle onto a copy of itself in R 2 . Define g(9) — (2 cos(0), 2 sin(0)). 
This embeds the circle as the circle of radius 2 in R 2 , centered at the origin. It 
seems clear that we can deform the first embedding to the second embedding. 
(See Figure 12.4.) But what is an explicit ambient isotopy? We can use the 
isotopy from the previous example. That is, take F : R 2 x / -> R 2 given by 
F(x, t) = it + l)x. This is an isotopy since for a fixed value t in /, F| R 2 X {,j 
is a homeomorphism of the plane to itself. 
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Moreover, when t = 0, F(x, 0) = x, so it is the identity map. And finally, 
when we restrict to t = 1, we have that 

F(/(0), 1) = (1 + 1 )/(0) = 2/(0) = g(6). 

Therefore the two embeddings of the circle are, in fact, ambient isotopic. 



FIGURE 12.4: An isotopy deforming the circle of radius 1 to the circle of radius 2. 


EXAMPLE 12.3. There is an ambient isotopy F : M 3 x / -» R 3 that takes 
the sphere S 2 and deforms it to the embedded sphere S depicted in Figure 12.5. 


>► 


/ 

v' 



FIGURE 12.5: The sphere S 2 can be deformed to the embedded sphere 5 by an 
ambient isotopy. 

The function F| R 3 x(0 ) is the identity map, which is a homeomorphism. 
The function F| K 3 x ji) sends the sphere to the embedded sphere S. We do not 
give an explicit formula for F. Often it is difficult or impossible to determine 
a formula for a specific ambient isotopy, although it is apparent from pictures 
that one exists. 



We now wish to apply the concept of ambient isotopy to knots. In order to 
avoid certain pathological situations, we assume that the knot can be realized 
by a finite number of sticks glued end-to-end, as in Figure 12.6. We call this a 
polygonal knot. We call the sticks edges of the polygonal knot, any adjacent 
pair of which meet at a vertex. 

Every smooth knot can be approximated arbitrarily closely by a polygonal 
knot by usjng many short sticks, so there is no loss in considering polygonal 
knots rather than smooth knots. 
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FIGURE 12.6: A knot made from sticks. 

We use finitely many sticks so that we can avoid so-called wildly embedded 
knots, such as the one appearing in Figure 12.7. These knots have strange 
behavior that belies the intuition we are attempting to capture with knots. 



FIGURE 12.7: A wildly embedded knot, which we exclude from consideration. 

Again, the mathematical field of knot theory investigates the various em¬ 
beddings of the circle S 1 into R 3 . We would like two distinct knots to be 
equivalent if we can deform one to the other without passing it through itself. 
How can we make that relation rigorous? 

DEFINITION 12.3. Two knots /, g : S l —> M 3 are equivalent if they are 
ambient isotopic. A collection of equivalent knots is called a knot type. 

Let H : R 3 x / -» R 3 be the ambient isotopy between knots given by the 
embeddings /, g : S 1 R 3 . The condition that H(x, 0) = x ensures that H 
starts as the identity map. The condition that H(f(x), 1) = g(x) ensures that 
by the time it is done, H has deformed the knot given by / to the knot given 
by g. The fact that the isotopy must be a homeomorphism for any fixed value 
of the second coordinate t prevents the knot from passing through itself during 
the deformation. 


EXAMPLE 12.4. All of the knots illustrated in Figure 12.8 are of the same 
knot type, since any one of them can be deformed to each of the others by 
ambient isotopies of R 3 . 



FIGURE 12.8: These knots are of the same knot type. 


j 
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The reason we use an ambient isotopy of the surrounding space R 3 is to 
avoid deformations that collapse a part of a knot away, as occurs in Figure 12.9. 
In that deformation, a part of R 3 is collapsed down to a point at the last stage, 
and that cannot occur in an ambient isotopy of R 3 . 



FIGURE 12.9: We do not allow collapsing. 

Typically, we picture an ambient isotopy by visualizing how a knot is being 
deformed through space, keeping in mind that throughout the deformation, the 
whole ambient space is being deformed along with the knot, and therefore the 
knot cannot pass through itself. (See Figure 12.10.) 



FIGURE 12.10: A sequence of deformations that shows that the initial and final 
knots are equivalent. 

We also assume that the isotopy is what is called piecewise-linear. This 
is to avoid the possibility of passing through a wild knot in the process of 
deforming one knot to another. A piecewise-linear ambient isotopy from one 
polygonal knot to another can be realized by a sequence of the following two 
operations on the knot (see Figure 12.11): 

(i) Replace an edge of the knot with two other edges, such that the 
three edges bound a triangle that intersects the knot along just the 
first edge. 

(ii) Replace two adjacent edges of the knot by one edge, such that the 
three edges bound a triangle that intersects the knot in exactly the 
first two edges. 
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FIGURE 12.11: The operations on a knot in a piecewise-linear ambient isotopy. 

We refer to these operations as triangle moves on the knot. Each of these 
moves clearly preserves the knot type, since it can be realized as an ambient 
isotopy deforming the knot continuously across the triangle to the other edge 
or edges, as illustrated in Figure 12.12. 



FIGURE 12.12: The triangle moves result from an ambient isotopy. 

Although it is worth keeping in mind that all knots should be constructed 
from finitely many line segments, we do not dwell on this, and we leave it 
to you to convince yourself that the smooth knots pictured and the ambient 
isotopies that we describe can be realized by piecewise-linear versions. 

It is convenient to use the term knot when referring to the entire equivalence 
class of a knot. For instance, we say that two knots are distinct when they are 
not in the same equivalence class. We say that a knot is trivial when it is in 
the same equivalence class as the knot given by the embedding that sends S 1 
onto the circle of radius 1, centered at the origin, in the xy-plane in R 3 . Only 
when confusion could arise do we revert to referring to a knot type. 

To keep track of knots and the operations we perform on them, we use 
pictures of knots called knot projections. Given a polygonal knot in 3-space, 
we can project it to a closed polygonal curve in a plane. The resulting planar 
curve is called a projection of the knot. A projection is called a regular 
projection if 

(i) No point in the projection corresponds to more than two points on 
the knot; 

(ii) There are only finitely many points in the projection that correspond 
to two points on the knot. These are called double points of the 
projection; 

(iii) No double point corresponds to a vertex of the knot. 

In Figure 12.13, we illustrate several situations causing a projection to fail 
to be a regular projection. 
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FIGURE 12.13: Projections that do not result in a regular projection. 


If a projection of a knot is not regular, we can move the knot just slightly 
before projecting to make the projection regular. We call this putting the knot 
in general position relative to the projection, and although we do not go into 
the technical details here, it is not hard to convince yourself that we can always 
do this. For instance, if there is a vertex projecting to a double point, turning 
the knot in space just slightly will move the vertex off the double point. This 
slight movement of the knot does not change the knot type. Hence, every knot 
has a regular projection. 

A regular projection of a knot appears as a topological graph in the plane. 
(See Figure 12.14.) The projected vertices of the knot, together with the double 
points of the projection, make up the vertices of the topological graph. If the 
projection of an edge of the knot has no double points in it, then it is an edge of 
this topological graph. Otherwise, the projected edge of the knot is subdivided 
into new edges by the vertices corresponding to the double points. 



FIGURE 12.14: A regular projection appears as a topological graph in the plane. 

A projection of a knot does not carry as much information as does the orig¬ 
inal knot. At a double point, we do not know which of the two corresponding 
strands in the knot is closer to the projection plane. However, a knot projec¬ 
tion is a regular projection that, at each double point, includes an indication of 
which strand is crossing over the other strand, relative to the projection plane, 
as in Figure 12.15. Knot projections are also referred to as diagrams. Given a 
knot projection, we can construct a knot in 3-space that is represented by that 
knot projection. Furthermore, any two such knots that we construct from a 
given knot projection are equivalent. 
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FIGURE 12.15: A knot projection. 

We call the double points in a knot projection the crossings of the projec¬ 
tion. 

A planar isotopy is a piecewise-linear isotopy of the plane. It deforms a 
polygonal knot projection to another polygonal knot projection without chang¬ 
ing the structure of the associated topological graph in the plane. Although a 
planar isotopy can stretch and shrink the distances between crossings, it cannot 
change the number of crossings and which crossings are connected by which 
edges of the projection. 

If we have a planar isotopy from one knot projection to another, then we 
can construct an appropriate ambient isotopy of R 3 between the corresponding 
knots, and therefore the knots are equivalent. On the other hand, an ambient 
isotopy between two knots does not necessarily project to a planar isotopy. For 
example, in a projection of an ambient isotopy we could see a deformation as 
illustrated in Figure 12.16. At the third stage we do not have a knot projection. 



FIGURE 12.16; A projected ambient isotopy that fails to be a planar isotopy. 

This deformation plays a fundamental role in our use of knot projections 
to study knots. It is what is called a Reidemeister move. We investigate 
Reidemeister moves further in the next section. 

The primary goal of knot theory is to find effective means to distinguish 
between different knot types. In Figure 12.17, we show the collection of knots 
with up to six crossings. Every knot that has a regular projection with six or 
fewer crossings is equivalent to one of these knots or to the mirror image of 
one of these knots. (Note that a knot and its mirror image are not necessarily 
equivalent. We discuss this further in Section 12.4.) 

In the top row in Figure 12.17 the first knot is the trivial knot, the second 
knot is known as the trefoil knot, and the third knot is known as the figure-eight 
knot. 

Each of the knots in Figure 12.17 is known to be distinct. If we made one 
of them out of string, we could never rearrange it to look like any of the others 
without cutting it open and retying it. But how do we rigorously show these 
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FIGURE 12.17: Distinct knots with six or fewer crossings. 


are distinct? We cannot just make one out of string and then try rearranging it 
to look like another. We might fail for six years, but that would not suffice to 
conclude that the knots are distinct. How do we know that another five minutes 
of work would not do the trick? 

Instead, we look for so-called invariants that we associate to a given knot 
and that do not depend on the knot type. There are a variety of different 
invariants that are used in studying knots, some of them numerical values (see 
Section 12.2), some of them polynomials (see Section 12.3), and some of them 
algebraic groups. The important point is that in all these cases, if two knots 
have different values of a particular invariant, we immediately know they are 
distinct knots. 

But how does one tell if a given candidate invariant actually is an invariant? 
It is usually too unwieldy to show that ambient isotopy between knots does not 
change the value of the invariant. There are just too many possible ambient 
isotopies, and even in the simplest examples, the equations become intractable. 

Luckily, we can simplify the task dramatically by using knot projections 
and applying Reidemeister moves to them. We discuss these ideas in the next 
section. 

Exercises for Section 12.1 


12 . 1 . In each case, find an ambient isotopy between the two given embeddings: 

(a) The functions /, g : I — *■ K 2 given by fix) = (x, x) and g(x) = (x 2 , x) 

(b) The functions /, h : / —»• K 2 given by fix) = (x, x) and h(x) -- (0, x) 

12 . 2 . Show that if F is an ambient isotopy between embeddings /, g : Y —► X, then 
/ is homotopic to g as functions that map Y to X. 

12 . 3 . Show that ambient isotopy defines an equivalence relation on the set of embed¬ 
dings of a space Y into a space X. 

12 . 4 . Sketch a series of pictures showing that the first knot in Figure 12.8 can be 
deformed to the second, and that the second knot can be deformed to the third. 
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12.5. Determine which of the knots in Figure 12.18 are equivalent, and conjecture as 
to which knots are not equivalent. (We can prove that two knots are equivalent 
by finding a series of pictures that depict the ambient isotopy from one to the 
other. However, as of yet in this text, we do not have a means of proving that 
two knots are not equivalent.) 

@ QP 

FIGURE 12.18: Which knots are equivalent? 

12.6. Most often, knot theorists think of a knot as embedded in S' 3 rather than R 3 . As 
we have seen, we can think of S 3 as the one-point compactification R 3 U {oo}, 
so there is just a single-point difference between the two spaces. The advantage 
to adding that single point to obtain S 3 is that the resulting ambient space is 
compact, which turns out to be useful. Show that if two knots are equivalent in 
R 3 , they are equivalent in S 3 as well. 



12.2 Reidemeister Moves and Linking Number 

Suppose that two knot projections represent equivalent knots. Then there 
must be an ambient isotopy from the knot represented by the first projection 
to the knot represented by the second projection. However, determining the 
existence or nonexistence of such an isotopy is difficult, since isotopies give us 
so much leeway. In 1927, Kurt Reidemeister (1893-1971) simplified the task 
by showing that the existence of an ambient isotopy between knots represented 
by two different knot projections is equivalent to the existence of a sequence 
of moves, called Reidemeister moves, that take us from one projection to the 
other. The Reidemeister moves fall into three types. 

The Type I Reidemeister moves allow us to put in or take out a kink in a 
projection, as in Figure 12.19. Notice that this certainly does not change the 
knot type. 

_<•> _SL_ «_<=* _Ji_ 

FIGURE 12.19: The Type I Reidemeister moves. 

For the Type II Reidemeister moves, we slide one of two adjacent strands 
under the other, as in Figure 12.20, or, taking the reverse perspective, we slide 
one strand out from under another, resulting in two adjacent strands. The two 
moves illustrated in Figure 12.20 are really the same. If we rotate the illustration 
of the right-hand move by 180 degrees we obtain an illustration that essentially 
shows the left-hand move. It is clear that the knot type is preserved by the Type 
II Reidemeister moves. 





Section 12.2 Reidemeister Moves and Linking Number 387 



FIGURE 12.20: The Type II Reidemeister moves. 


For the Type III Reidemeister moves, a strand is slid past a crossing, over 
the two strands that make up the crossing, as illustrated in Figure 12.21. 

If we examine the moves in Figure 12.21 from the perspective of the bottom 
strand, then in the illustrated moves the bottom strand moves under a crossing 
formed by the other two strands. Thus the Type III moves, as shown, include 
the possibility of moving a strand under a crossing. Furthermore, the illustrated 
Type III moves also include the possibility of moving a strand past a crossing, 
between the two strands that make up the crossing. 



FIGURE 12.21: The Type III Reidemeister moves. 

Next, we introduce Reidemeister’s Theorem, a significant result, which, 
as we will subsequently see, enables us to verify that the knot invariants we 
define are indeed invariant of the knot type. 

THEOREM 12.4. Reidemeister’s Theorem. Two knots are equivalent 
if and only if there is a finite sequence of planar isotopies and Reidemeister 
moves taking a knot projection of one to a knot projection of the other. 

Proof. As mentioned previously, planar isotopies and Reidemeister 
moves do not change knot type, so it is immediate that if two knot projec¬ 
tions are related through a sequence of planar isotopies and Reidemeister 
moves, then they represent equivalent knots. 

Suppose now that two knot projections correspond to equivalent 
knots. We need to show that there is a sequence of planar isotopies 
and Reidemeister moves from one to the other. 

We assume that both of our knots are polygonal, and therefore the 
topological graphs associated to the corresponding knot projections are 
made up of line segments attached at vertices. Since the two knots rep¬ 
resented are equivalent, there is a piecewise-linear ambient isotopy from 
R 3 to R 3 that deforms one knot to the other. Because the isotopy is 
piecewise-linear, by definition we can realize its impact on the knot by a 
sequence of triangle moves. 
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As long as the projection of each stage of the ambient isotopy is a 
knot projection, then we are simply following along a planar isotopy. 
But what happens when a stage of the ambient isotopy does not yield a 
regular projection? It can be shown that there are essentially three ways 
that this can happen. 

Consider the triangle move illustrated in Figure 12.22. At stages 
(c) and (e) the projection is not a regular projection. As the triangle- 
move deformation progresses between stages (b) and (d) through (c), the 
result in the projection is a Type III Reidemeister move. And, as the 
deformation progresses between (d) and (f) through (e), the result is a 
Type II Reidemeister move. 






FIGURE 12.22: The Type III and Type II Reidemeister moves occur during a triangle 
move as illustrated. 

In Figure 12.23 we illustrate the third situation where a projection of 
an ambient isotopy is not a regular projection. With the projection of the 
edges as shown, in stage (c) we do not have a regular projection. The 
overall change in the projection, as the deformation goes from stage (a) 
to stage (e), or vice versa, is a Type I Reidemeister move. 



FIGURE 12.23: The Type I moves occur when the edges in the triangle move are 
arranged as shown. 
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Thus, when the triangle moves are projected in a diagram they result 
in a sequence of planar isotopies or Reidemeister moves. Therefore if 
two knots are equivalent, then we can get from a knot projection of the 
first to a knot projection of the second by a sequence of planar isotopies 
and Reidemeister moves. ■ 


In addition to knots, we also consider links. 


DEFINITION 12.5. A link is an embedding of a set of circles in R 3 . Two 
links are considered equivalent if one can be deformed to the other via ambient 
isotopy. Each embedded circle is called a component of the link and a link is 
called an n-component link if it has n components. 

Figure 12.24 depicts two of the more well-known links, the Whitehead link 
and the Borromean rings. The first is a two-component link and the second is 
a three-component link. The Borromean rings appeared on the family crest of 
the Borromea family during the Italian Renaissance. 




FIGURE 12.24: The Whitehead link and the Borromean rings. 

All of the ideas and results presented for knots (regular projections, Reide¬ 
meister moves, Reidemeister’s Theorem, and so on) carry over here for links. 
There is an easily computed invariant, called the linking number, that we use 
to distinguish links from each other. We introduce it next. 

Let L be a two-component link. On each component, we pick a direction 
of travel around the component. We call the direction an orientation, and we 
denote it by placing arrows on the component. In a particular diagram of the 
link, each crossing appears as one of the two pictures in Figure 12.25. 



FIGURE 12.25: Possible crossings in a projection of an oriented link. 
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Label each crossing that occurs between the two different components 
with a +1 or —1, according to whether the crossing matches (a) or (b) in 
Figure 12.25. We denote the label of a crossing c by 1(c). 

DEFINITION 12.6. The linking number of a diagram of an oriented two- 
component link L is given by 


lk(L) = ] -J2 l ( c ), 

C 

where the sum is taken over all of the crossings c involving both of the compo¬ 
nents in the link. 


EXAMPLE 12.5. We determine the linking number of the diagram of the 
oriented link L that appears in Figure 12.26. 



FIGURE 12.26: The oriented link L with its crossings labeled. 

There are four crossings with label +1 and two crossings with label — 1. 
We do not label the crossings that occur between a component and itself. So, 
lk(L) = j(+l + 1 + 1 + 1 — 1 — 1) = 1. 


Linking number is defined for a link projection associated to a given ori¬ 
ented link. The next theorem shows that linking number is actually well defined 
for the oriented link itself, independent of the choice of a link projection. Given 
that, the theorem implies that linking number is an invariant for oriented links. 

THEOREM 12.7. If two oriented links are equivalent, then all of their dia¬ 
grams have the same linking number. 

Proof. We must show that if we have two link projections of equivalent 
oriented links, then the linking number computed from each one is the 
same. As with knot projections, if two links are equivalent, then their 
link projections are related through a sequence of planar isotopies and 
Reidemeister moves. The planar isotopies do not change the crossings 
and hence do not affect the linking number. Thus, it is enough to show 
that the linking number is unaffected by Reidemeister moves. 
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Suppose we do a Type I Reidemeister move. It either eliminates or 
creates a crossing between a single component and itself. However, it 
does not change the crossings between the components. So the linking 
number is unaffected by a Type I move. 

Suppose we slide a strand under an adjacent strand via a Type II 
Reidemeister move. If the two strands are from the same component, this 
does not change the linking number. If the two strands are from different 
components, we create a +1 crossing and a —1 crossing. Since in the 
linking number, the +ls and —Is are summed, this leaves the linking 
number unaffected. Similarly, if we eliminate two crossings between 
distinct components via a Type II move, we eliminate a +1 crossing and 
a — 1 crossing, and therefore the linking number remains unchanged. 

Finally, if we do a Type III Reidemeister move, the linking number 
is unchanged. We simply move the positions of the labeled crossings, so 
we do not change the overall sum of the labels. 

Thus, the Reidemeister moves leave the linking number unchanged, 
and therefore all diagrams of equivalent oriented links have the same 
linking number. The linking number is an invariant of oriented links. 


EXAMPLE 12.6. Consider the links in Figure 12.27. The link T is called 
the trivial link of two components. It has linking number 0. The link in 
the middle, M, has linking number +2 or —2 depending on the choice of 
orientations. Thus, link M is not trivial. 

The linking number of the Whitehead link, W, is also 0. Hence, linking 
number does not distinguish W from the trivial link, but it does distinguish W 
from M. 



FIGURE 12.27: The linking number distinguishes T and W fromM, but not T from W. 

_i 


Exercises for Section 12.2 

12.7. Determine a sequence of Reidemeister moves that would take us from the first 
projection of the knot depicted in Figure 12.28 to the second projection. 
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FIGURE 12.28: How does the first projection transform to the second? 


12.8. Use linking number to distinguish between the two links shown in Figure 12.29. 



FIGURE 12.29: Show that these links are not equivalent. 


12.9. Use linking number to show that the link appearing in Figure 12.30 is not the 
trivial link of two components. 



FIGURE 12.30: Show that this link is not trivial. 


12.10. Use linking number to prove that the two links shown in Figure 4.20 are not 
equivalent, and then use that result to argue that there is no ambient isotopy 
between the two embeddings of the annulus illustrated in Figure 4.19. 

12.11. Prove that the two three-component links in Figure 12.31 are not equivalent. 




FIGURE 12.31: Show that these two links are not equivalent. 
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12.3 Polynomials of Knots 

In his 1928 paper, “Topological invariants of knots and links,” J. W. Alexander 
(1888-1971) introduced a polynomial invariant for knots. This invariant made 
it possible to distinguish between knots by showing that they have different 
associated polynomials. However, the Alexander polynomial is limited in the 
distinctions it allows—there are different knots that have the same Alexander 
polynomial. In the 1980s, Vaughan Jones discovered a new polynomial invari¬ 
ant, now known as the Jones polynomial. This resulted in a dramatic increase 
in the number of knots that could be distinguished. Since the introduction of 
the Jones polynomial, a variety of new polynomial invariants have been dis¬ 
covered. Here, we introduce the Jones polynomial using the approach of Louis 
Kauffman. 

Suppose that we want to create a polynomial invariant for knots and links. 
We assume that it is a Laurent polynomial, meaning the variable can appear in 
the polynomial with either positive or negative integer exponents. We would 
like to be able to compute the polynomial from a given knot projection. How¬ 
ever, for it to be an invariant, it cannot depend on the particular projection. We 
need to know that it is unaffected by changes in the knot projection. 

In the previous section, we saw that any two distinct projections of a given 
knot are related by a sequence of Reidemeister moves. So if we can create a 
polynomial from a knot projection such that it is unaffected by Reidemeister 
moves, it will be the same for all knot projections. We will have created a 
polynomial invariant for knots. 

First, let us assume that we are creating a polynomial in three variables 
A, B, and C that can be calculated from a knot or link projection. Initially, 
we will not assume that it is the same for all projections of a given knot. 
Ultimately, we will reduce the polynomial to a polynomial in one variable A as 
we work toward defining a polynomial invariant from it. We call the resulting 
polynomial a bracket polynomial, and we denote the bracket polynomial of 
a given knot or link projection P by < P > . For simplicity, in the remainder of 
the section we say “projection” where we mean “knot or link projection.” 

We assume that three rules are satisfied by the polynomial. 

Rule 1: < 0 > = 1 where O is a trivial-knot projection with no crossings. 

This says that a knot projection with no crossings has polynomial 1. This is 
what is called a normalization condition. 

Rule 2: <P uQ> = C<P>. 

This rule indicates that adding an extra, unlinked trivial component O to a 
projection P multiplies the polynomial of P by C. 

Rule 3: <X> = A<)(> + S<X!>> 

<X> = a<X> + #<)(>. 

Rule 3 says that we can obtain the polynomial associated to a projection P 
in terms of the polynomials associated to two new projections, each with one 
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fewer crossing than P . This is called a skein relation. There are two versions 
of this relation, but they are essentially the same. 

The first version of Rule 3 indicates that given a projection, if we take a 
crossing in it that is arranged as shown and we cut through the crossing vertically 
to obtain a new projection and we cut through the crossing horizontally to obtain 
another new projection, then the bracket polynomials of the three projections 
are related as in the equation. The second version of Rule 3 can be interpreted 
similarly, but note that if we rotate each diagram in the second version by 90 
degrees, we obtain the first version, and therefore, as already indicated, the 
two versions of the relation really are the same. How the skein relation works 
should become clear as we employ it in a number of situations hereafter. 

Given these rules, we want to make choices for B and C so that the re¬ 
sultant polynomial is an invariant. In particular, we want it to be unaffected 
by the Reidemeister moves. Let us start with the Type II Reidemeister move. 
Applying the bracket polynomial rules, we obtain the following: 

<(f> = A<^> + B<(\> 

= A[A<X> + R<^>] + R[A<)(> + Z?<X>] 

= AA<X> + ABC<X> + #A<)(> + RZ?<X> 

= [A 2 + ABC + B 2 ]<X> + BA<)(>. 

To have the bracket polynomial unaffected by Type II Reidemeister moves, we 
want < = <)(>■ From the relationship we just derived, it follows that we 

need to have BA — 1 and A 2 + ABC + B 2 = 0. Therefore we set B = A -1 
and C = — A 2 — A~ 2 . The original three rules then become 

Rule 1: <0> = 1- 

Rule 2: <PuO> = M 2 -A‘ 2 ]</’>. 

Rule 3: <X> = A<) (> + A -1 <X>. 

<X> = A<X> + A -I <)(>. 

With these new rules, we know that the resulting polynomial will be unchanged 
by Type II Reidemeister moves. 

Next, we look at the bracket polynomials of a few specific projections. 
Following that, we return to examining how the bracket polynomial is affected 
by Reidemeister moves. 


- ] 

EXAMPLE 12.7. The following bracket polynomials are not difficult to verify 
using Rules 1-3. We ask you to do so in Exercise 12.12. 


< 00 > = A 2 - A- 2 

<CO> = < oo > = -^ 3 

<@> = -A- 3 <@> = -A 3 


J 
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EXAMPLE 12.8. Here we compute the bracket polynomial of the projection 
of the trefoil knot. Rule 3 is used to obtain the first two equalities shown, 
and then Rule 2 and a little algebra are used to obtain the third equality: 

= A<^-P> + A -1 

= A[A<® C ^ ) > + A~*<^-P>] + A _1 [A< ( ^Q ) > + A _I < < \Q^>] 

= A 2 [—A 2 - A- 2 ]<QO> + 2<Q0> + A~ 2 <@>. 

Now, if we substitute in values from the bracket polynomials in Example 12.7 
and then simplify, we obtain 

<<§>> = A 2 [-A 2 - A- 2 ](-A 3 ) + 2(—A 3 ) + (A _2 )(-A -3 ) 

= A 7 - A 3 - A“ 5 . 

Therefore the bracket polynomial of the projection is A 7 — A 3 — A" 5 . 


We now return to our investigation of the impact of Reidemeister moves on 
the bracket polynomial. What happens under Type III Reidemeister moves? 
For the first form of the Type III move, we obtain the following: 

<K> = a<;k > +a- i <:k > 

= A<)([> + A -1 <>|> 

= <>[>• 

The first and third equalities hold by the skein relation, and the second equality 
holds by the fact that the bracket polynomial is unchanged by Type II moves. 
Conveniently enough, the bracket polynomial is unaffected by a Type III move 
of the first form, given that it is unaffected by a Type II move with our choice of 
B and C. It can similarly be shown that the bracket polynomial is unaffected 
by a Type III move of the second form. 

Finally, we must consider the impact of Type I Reidemeister moves. Ap¬ 
plying the skein relation and our rules, we obtain the following: 

<JL> = A<_A_> + A _I <_2_> 

— A<—> + A _1 [—A 2 — A~ 2 ]< —> 

= —A _3 <—>. 

Moves of Type I have an impact on the polynomial, creating a serious problem. 
We do not obtain the needed equality, <JL> = <—>, for the bracket 
polynomial to be invariant under the first form of the Type I Reidemeister move. 
A similar situation occurs with the second form of the Type I Reidemeister 
move. 

To deal with this issue, we need one other concept, called the writhe. 
Using it, we will be able to define a knot and link invariant from the bracket 
polynomial. The writhe is defined like linking number, only we consider every 
crossing in a projection, not just those involving different components of a link. 
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DEFINITION 12.8. Given a projection P of an oriented link, the writhe of 
P, denoted w(P ) is the sum of the labels, +1 or — 1, at all of the crossings 
in P. 


EXAMPLE 12.9. The writhe of the first projection in Figure 12.32 is 0, while 
the writhe of the second one is 7. 



FIGURE 12.32: Computing the writhe of the given projections. 


Notice that when we perform a Type I Reidemeister move on a projection, 
as in Figure 12.33, the writhe either decreases by 1 or increases by 1. 



FIGURE 12 33: A Type I move decreases or increases the writhe by 1. 

At this point we have defined the bracket polynomial for projections. It 
is unchanged by Type II and Type III Reidemeister moves, but it changes 
under Type I moves. Therefore the bracket polynomial is not an invariant for 
knots and links since it depends on the particular projection. We have also 
defined the writhe, a property of projections of oriented links. We use the 
writhe with the bracket polynomial to obtain a new polynomial defined for 
projections of oriented links. As we show in Theorem 12.10, this polynomial 
does not depend on the particular projection of a knot or link and therefore is 
a polynomial invariant. 

DEFINITION 12.9. The Kauffman X polynomial of a projection P of an 
oriented link is defined to be the polynomial X(P) = (—A 3 )~ w< - P ^ <P>. 

THEOREM 12.10. The Kauffman X polynomial is an invariant for knots and 
for oriented links. 

This theorem indicates that the X polynomial is an invariant for knots, 
whether they are oriented or not, and for links that are oriented. The reason 
for this difference is that the writhe of a projection of an oriented knot is not 
changed if the orientation of the knot is changed, but the writhe of an oriented 
link can change if the orientation on one of the link components is changed. 
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Proof. To prove the theorem we need to show that the X polynomial is 
the same for all projections of a given knot or oriented link. Therefore 
we need to show that it is unaffected by the Reidemeister moves. 

First consider the Type II Reidemeister move. We have already seen 
that the bracket polynomial is unaffected by Type II moves. The writhe 
is also unchanged by Type II moves because such moves either add two 
crossings of opposite sign or eliminate two crossings of opposite sign. 
Thus the X polynomial is unaffected by Type II Reidemeister moves. 

Next, consider the Type III Reidemeister moves. Here too, we have 
already seen that the bracket polynomial is not affected by such moves. 
Furthermore, a Type III move just rearranges crossings, without changing 
the labels on any of them. Therefore the writhe is unchanged by Type 
III moves. It follows that the X polynomial is unaffected by Type III 
Reidemeister moves. 

Finally, when we perform a Type I Reidemeister move, putting a kink 
in our projection as on the left in Figure 12.33, we see that the writhe 
goes down by one. This results in the polynomial being multiplied by 
—A 3 . However, the bracket polynomial is multiplied by a factor of—A -3 
by that same move, as seen in the relationship < JL> = — A~ 3 <—> 
previously derived. Therefore the X polynomial is unaffected by the first 
form of a Type I Reidemeister move. We leave it to you to check that the 
X polynomial is not affected by the other form of a Type I Reidemeister 
move. 

It follows that the Kauffman X polynomial is an invariant for knots 
and oriented links. ■ 


EXAMPLE 12.10. We calculate the Kauffman X polynomial for the trefoil 
knot From Example 12.8 we have the bracket polynomial 

<&> = A 7 -A 3 - A -5 . 

The writhe of the projection is —3. Therefore the Kauffman X polynomial 
of the trefoil knot is 

X((£P) = (-A 3 ) 3 (A 7 - A 3 - A -5 ) = -A 16 + A 12 + A 4 . 


As a result of Example 12.10, we can now prove that there are nontrivial 
knots. In particular, we have the following theorem: 

THEOREM12.il. The trefoil knot is not equivalent to the trivial knot. 

Proof. The Kauffman X polynomial of the trivial knot is 1 and therefore 
is not equal to the Kauffman X polynomial of the trefoil knot. Since 
the Kauffman X polynomial is an invariant for knots, it follows that the 
trefoil knot is not equivalent to the trivial knot. ■ 
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We have shown that there are nontrivial knots. Not only have we done 
that, but we also have a means for distinguishing between many knots. In fact, 
with the Kauffman X polynomial, it is possible to distinguish between all of 
the knots with six or fewer crossings shown in Figure 12.17. 

The Kauffman X polynomial is equivalent to the Jones polynomial, which 
was first discovered in 1984. The Jones polynomial is defined using a slightly 
different set of rules than the three we used in defining the bracket polynomial; 
the result is a polynomial with fractional exponents. The Jones polynomial can 
be obtained from the Kauffman X polynomial, however, by substituting t -1 / 4 
for A. 


Exercises for Section 12.3 

12.12. Compute the bracket polynomials of each of the projections in Example 12.7, 
verifying the polynomials presented there. 

12.13. (a) Compute the bracket polynomial of the trivial link of n components shown 

on the left in Figure 12.34. 

(b) Compute the bracket polynomial of the twisted trivial-knot projections 
shown on the right in Figure 12.34. 


ooo - o 

n components 


ooooo 

oocoo 


n crossings 

FIGURE 12.34: The trivial link of n components and the twisted trivial-knot projec¬ 
tions. 

12.14. Compute the writhe of the projections shown in Figure 12.35. 




FIGURE 12.35: Compute the writhe of each projection. 


12.15. Compute the X polynomial of the mirror image of the trefoil knot whose 

X polynomial we computed in Example 12.10. What does your result imply 

about the knots and Ppp? 

12.16. Compute the X polynomial of the figure-eight knot (the third knot in the top 
row in Figure 12.17). 
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12.17. Compute the X polynomials of the trivial link of two components and each 
of the oriented Hopf links shown in Figure 12.36. Conclude that these three 
oriented links are distinct. 



FIGURE 12.36: Oriented Hopf links. 


12.18. The original Jones polynomial V (L) of a knot or link L is obtained from the X 
polynomial by the substitution A = r -1 / 4 . 

(a) Find the Jones polynomial of the figure-eight knot. 

(b) Use the skein relation to show that the Jones polynomial satisfies the skein 
relation t~ l V(L+) - tV(L-) + ( r' ! 2 - t ] ' 2 )V(L 0 ) = 0, where L+, 
L_, and Lq are three projections that are identical except for the portions 
appearing in Figure 12.37. 



FIGURE 12.37: The projections L + , L-, and Lq differ in only one place, as shown 
here. 


12.4 Applications to Biochemistry and Chemistry 

In this section, we consider applications of knot theory to enzyme actions on 
DNA and to chemical design. 

Knots in DNA 

We introduced the structure of DNA in Section 5.2, where we indicated that 
a DNA molecule is made up of two chains of nucleotides that bond together. 
The two chains are tightly wrapped around one another, not unlike two lengths 
of twine that have been braided into a thicker rope. Despite this higher-order 
structure, a DNA molecule—like a rope—can be modeled as a single long, thin 
strand. In addition, we call a DNA molecule cyclic if one end of the molecule 
is glued to the other. Cyclic DNA is common, but, much of what we say in 
this section holds even for noncyclic DNA. 

As we stated in the introduction to the book, the level of DNA compaction 
in the nucleus of a cell is equivalent to stuffing 200 kilometers of fishing line into 
a basketball. Problems relating to DNA tangling are therefore quite serious for 
biological systems. As an example, consider the process of DNA replication. 
The two nucleotide chains from one DNA molecule separate, and each is used 
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as a template for construction of a complete, new molecule. Because the 
templates are heavily intertwined at the start, the two new molecules are bom 
tangled together. For them to be useful, the cell must be able to separate them. 
How does this occur? 

Inside the nucleus of the cell, there are enzymes that allow the DNA to 
disentangle and tangle. Biochemists would like to understand the action of 
each of these enzymes. One such enzyme, topoisomerase II, takes two strands 
of DNA, cuts one open, passes the other through, and then closes up the first. 
This amounts to a crossing change, as shown in Figure 12.38. 



FIGURE 12.38: An enzyme making a crossing change. 

We would like to know whether such moves are sufficient to untangle DNA 
molecules. The following theorem answers that question in the affirmative: 

THEOREM 12.12. Given a knot projection, there is a subset of the crossings 
that can be switched to obtain a diagram of the trivial knot. 

Proof. Given a knot projection P, we define an algorithm for switching 
crossings so that the resulting knot projection P' corresponds to the trivial 
knot. 

The Unknotting Algorithm: On P, choose a noncrossing point as a 
starting point, and choose a direction of travel. (See Figure 12.39.) As we 
travel around P from the starting point, each time we arrive at a crossing 
for the first time, change it to an overcrossing relative to our position if 
it is not already an overcrossing. (When we arrive at a crossing for the 
second time, it will be an undercrossing relative to our position, and we 
leave it as is.) Continue this process as we travel all of the way around 
P, back to the beginning. 

The example in Figure 12.39 shows how the Unknotting Algorithm 
changes crossings (the ones that are shaded) to produce the trivial knot. 

Given a knot projection P, let P' be a knot projection obtained by 
applying the unknotting algorithm to P. We claim that P' is a projection 
of the trivial knot. To establish this, we construct a knot K in M 3 having 
knot projection P', then show that K is trivial. 

Assume that the knot projection P' lies in the ry-plane. We construct 
K so that it projects directly down onto P'. Therefore, above each point 
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( x , y ) in P', we choose a point (x, y, z ) on the knot in 3-space. The 
points on the knot are chosen so that as we travel around P' in the given 
direction from the starting point, the ^-coordinate decreases continuously 
until just after our last encounter with a crossing, then increases so that 
we make our way back up to the first point we chose on the knot. (See 
Figure 12.40.) 



FIGURE 12.40: Building a knot from a knot projection. 

Clearly, if we project K directly down to the xy-plane, it projects 
onto P'. Furthermore, since the ^-coordinates are chosen as they are, we 
have the proper overcrossing/undercrossing relationship at each crossing 
in the diagram. Finally, with the knot K as constructed, it is not difficult 
to see that the decreasing-z part of K and the increasing-z part of K can 
be deformed to separate halves of a circle making up a trivial knot in R 3 . 
(See Figure 12.41.) It follows that K is trivial, and P' is a knot projection 
ofK. ■ 
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FIGURE 12.41: The knot K deforms to a trivial knot. 

As a consequence of Theorem 12.12, every knot has a minimum number 
of crossing changes needed to unknot it. 

DEFINITION 12.13. The unknotting number of a knot K is the minimum 
number of crossings, over all knot projections of K, that must be changed in 
order to turn some knot projection of K into one corresponding to the trivial 
knot. 


-i 

EXAMPLE 12.11 . The figure-eight knot shown on the left in Figure 12.42 is 
nontrivial. (See Exercise 12.16.) However, one crossing change turns it into 
the trivial knot. Therefore its unknotting number is 1. 



FIGURE 12.42: The figure-eight knot can be made trivial with one crossing change. 


EXAMPLE 12.12. The 7 4 knot shown on the left in Figure 12.43 can be 
unknotted with the two crossing changes shown on the right. However, proving 
that its unknotting number is 2 requires showing that it does not possess a 
diagram that, with one crossing change, can be made into a diagram of the 
trivial knot. Although this can be shown, we do not have the tools necessary 
to prove it here. 




FIGURE 12.43: Unknotting the 74 knot. 


Besides the topoisomerase II enzyme, there are other enzymes that also 
perform sophisticated actions on DNA strands. As these enzymes are isolated, 
knot-theoretic properties can be used to categorize their function. 
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Once an enzyme is isolated in the laboratory, it is applied to a collection 
of unknotted cyclic DNA molecules. The various knots that result from its 
application are then determined, and the action of the enzyme is deduced from 
the knots. But how are the resulting knots determined? They are so tiny that, 
although electron microscopes can sometimes be employed to determine the 
knotting of an individual DNA molecule, there are too many molecules present 
to use this method effectively. 

Fortunately, biologists have developed numerous methods for separating 
mixtures of molecules on the basis of their chemical and physical properties. 
For example, if a mixture of variably sized molecules is passed through a porous 
material, it is not hard to imagine that the smaller molecules will wiggle their 
way through first. The degree of knottedness in a molecule also influences 
its ability to traverse such a material. In one such process, called gel elec¬ 
trophoresis, it has been experimentally demonstrated that for small numbers of 
crossings the speed with which a knot passes through a porous material is pro¬ 
portional to the number of crossings. This technique is used to divide the knot 
mixture into its constituent knot types. Each type of knot is then imaged using 
electron microscopy, providing insight into the function of the novel enzyme. 

The chemotherapy drug doxorubicin specifically prevents enzymes from 
unknotting DNA, thereby inhibiting the ability of DNA to replicate (a prereq¬ 
uisite for cell division). Since cancer cells divide very quickly, they are partic¬ 
ularly vulnerable to this effect, making the drug an effective cancer-fighter. 

Knots in Synthetic Chemistry 

In addition to cyclic DNA molecules, much simpler molecules can also in¬ 
clude cycles of bonded atoms. A well-known example of a cyclic molecule is 
benzene, consisting of six carbon and six hydrogen atoms bonded as shown in 
Figure 12.44. 


H H 

C=C 

4 x o 

Vcf 

H H 


FIGURE 12.44: A diagram of the benzene molecule. 

There are many other common cyclic molecules employed in chemistry, 
and, like benzene, they appear in an unknotted form. However, given a cyclic 
molecule in an unknotted form, an obvious question is whether or not we can 
create a knotted version of the molecule as well. In addition, would the knotted 
version of the molecule have different chemical and physical properties from 
the unknotted version? 

It was relatively recently that chemists first synthesized a knotted molecule. 
A challenge in so doing was to create a knotted molecule with relatively few 
atoms (in contrast to DNA, which has millions of atoms and readily forms 
knots). In 1988, Christina Dietrich-Buchecker and Jean-Pierre Sauvage of 
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the University of Stausborg became the first chemists to synthesize a knotted 
molecule. Their process required some creative chemistry, using two copper 
ions to hold two molecular strands in a particular shape so that when the strands 
were bonded and the copper ions removed, the resulting molecule was knotted. 
The molecule consists of 104 carbon, 104 hydrogen, 8 nitrogen, and 14 oxygen 
atoms. A diagram of the molecule is shown in Figure 12.45; each unlabeled 
vertex in the diagram represents a carbon atom bonded to either 0, 1, or 2 
hydrogen atoms so that each carbon atom has 4 bonds around it (counting each 
doubled edge as two bonds). 



FIGURE 12.45: A diagram of the first synthesized knotted molecule. 

Since the first synthesis of a knotted molecule, there has been continued 
interest in the development and study of techniques for their formation. Once 
chemists discover simple methods for synthesizing knotted molecules, a new 
world of chemical products will likely result. Each distinct knot could coin¬ 
cide with a distinct molecule generated by the same set of constituent atoms 
bonded in the same cyclic order. Given a particular cyclic molecule, the only 
bound on the number of different knotted molecules that can be made from 
it is the flexibility of the molecule. The process of distinguishing between 
knotted molecules and determining their properties will certainly benefit from 
our understanding of knots and their properties. 

The property known as chirality is of significant interest in chemistry, even 
for molecules that are neither knotted nor cyclic. A molecule that cannot 
be superimposed on its mirror image is called chiral; otherwise, it is called 
achiral. The drug thalidomide is an example of a chiral molecule. Diagrams 
of (R)-thalidomide and (S)-thalidomide are shown in Figure 12.46. These 


C 8 H 4 0 2 N 




(R)-Thalidomide 


(S)-Thalidomide 


FIGURE 12.46: The molecules (R)-thalidomide and (S)-thalidomide. 
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two molecules are mirror images of each other by a reflection in the plane of 
the page. In (R)-thalidomide, on the left in the figure, the solid tapered bond 
indicates that the H atom is in front of the plane of the page, while the dashed 
tapered bond indicates that the C 8 H 4 O 2 N group is behind the plane. The 
situation with (S)-thalidomide is the reverse. If we tried to superimpose (R)- 
thalidomide onto (S)-thalidomide by rotating the former so that the H atoms 
and the C 8 H 4 O 2 N groups in each coincide, then the remaining parts of the two 
molecules would not align properly. Thus, thalidomide is a chiral molecule. 

Chemists have long recognized the importance of chirality. A chiral 
molecule and its mirror image have similar physical properties and similar 
interactions with achiral molecules, but they can have dramatically different 
reactions with other chiral molecules. Thalidomide is a classic example. It 
was prescribed to pregnant women in the 1950s and 1960s to alleviate morning 
sickness. While (R)-thalidomide is effective in that role, the drug was soon 
linked to a wave of birth defects. It is now known that (S)-thalidomide, also 
present in doses of the drug, is responsible for these unfortunate effects. The 
thalidomide case motivated interest in both the biological chemistry of chiral 
compounds and the adoption of more stringent drug-safety protocols. 

If we take knotted molecules into consideration, then there are important 
chirality questions that arise just from a knot-theoretic perspective. For in¬ 
stance, if a molecule is represented by a knot that is not equivalent to its mirror 
image, then the molecule is automatically chiral. Therefore it is of interest to 
investigate which knots are equivalent to their mirror image. 

The mirror image of a knot K is the knot obtained by taking the reflection 
of AT in a plane in R 3 . It is straightforward to show that the knot type of the 
reflection is independent of the plane chosen. If we have a diagram of K, and 
reflect K in the plane of that diagram, then it follows that the diagram of K 
can be converted into a diagram of the resulting mirror image by switching all 
of the crossings. (See Figure 12.47.) 




FIGURE 12.47: Diagrams of a knot and its mirror image. 


In Figure 12.10 we show a deformation that starts at the figure-eight knot. 
If you rotate the final stage of the illustration by 180 degrees and deform the 
knot a little further, it is easy to see that you end up with the mirror image 
of the knot at the start of the deformation. Therefore the figure-eight knot is 
equivalent to its mirror image. On the other hand, the trefoil knot and its 

mirror image ^9 have distinct Kauffman X polynomials and therefore are not 
equivalent to each other. (See Example 12.10 and Exercise 12.15.) 
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DEFINITION 12.14. A knot that is equivalent to its mirror image is said to' 
be amphichiral. Otherwise, the knot is said to be chiral. 

We can use the Kauffman X polynomial to help determine if a knot is 
chiral. To begin, we have the following theorem that relates the Kauffman X 
polynomial of a knot to the polynomial of its mirror image: 


THEOREM 12.15. Let K be a knot with Kauffman X polynomial P. If K* 
is the mirror image of K, then the Kauffman X polynomial of K* is obtained 
by substituting A ~ 1 for A in P. 

Proof. See Exercise 12.23. ■ 

From Theorem 12.15 it follows that if the Kauffman X polynomial of a 
knot changes under the substitution of A -1 for A, then the knot and its mirror 
image are not equivalent, and therefore the knot is chiral. Polynomials that do 
not change under a substitution of A -1 for A are palindromic, a property of 
polynomials that is defined as follows: 


DEFINITION 12.16. A polynomial 

a- n A ”+<z—«+iA n+1 + ... + a n -i A” *+a n A” 

is called palindromic ifa-j = aj for all j — 1,... ,n. 

Thus a polynomial is palindromic if its coefficients read the same forwards 
as backwards. It is straightforward to see that a polynomial 

a- n A " + a- n +iA + ... + a n —\A n 1 -\-a n A n 

is palindromic if and only if it is unchanged under the substitution of A -1 for A. 
We now obtain the following corollary of Theorem 12.15. 


COROLLARY 12.17. If the Kauffman X polynomial of a knot K is not palin¬ 
dromic, then K is chiral. 

Corollary 12.17 yields a simple test for the chirality of knots and knotted 
molecules. In this way, it is possible to use basic ideas from topology and knot 
theory to distinguish between molecules. 

Building chemical compounds and distinguishing between them are impor¬ 
tant aspects of synthetic chemistry. Determining the properties of the resulting 
compounds is also significant. In Section 13.2, we will show how properties 
of topological graphs can be used to help predict properties of molecules. 

Exercises for Section 12.4 

12.19. Describe an algorithm for changing crossings in a link projection of an n- 
component link so that the result is a link projection of the trivial link with n 
components. 
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12.20. Determine the unknotting number of the knots appearing in Figure 12.48. 



FIGURE 12.48: Determine the unknotting number. 

12.21. Show that the unknotting number of each knot in Figure 12.49 is at most 2. 



FIGURE 12.49: Show that the unknotting number of each is at most 2. 

12.22. Sketch the knot associated with the knotted molecule in Figure 12.45. Then 
determine the knot in Figure 12.17 that is equivalent to this one by sketching a 
deformation between the two. 

12.23. Prove Theorem 12.15: Let K be a knot with Kauffman X polynomial P. If/C* 
is the mirror image of K, then the Kauffman X polynomial of K* is obtained 
by substituting A -1 for A in P. (Hint: First prove the result for the bracket 
polynomial, then show that it holds for the Kauffman X polynomial.) 

12.24. Prove that the knot 52, which is the last knot in the top row in Figure 12.17, is 
chiral. 
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In the introduction to this book, we indicated that Euler’s solution to the 
Konigsberg bridges problem can be considered the moment of birth of the 
field of topology. It can equally well be considered the starting point for the 
field of graph theory. Naturally there is overlap between the two fields, and that 
overlap is a subdiscipline of each, known appropriately as topological graph 
theory. 

We consider graphs and some of their properties in the first section of this 
chapter. In Section 13.2, we present an application to the quantitative structure- 
property relationship in chemistry, where structural aspects of molecules (for 
example, graph models) are used to predict properties of the molecule. We 
address important questions and results about embeddings of graphs in Sec¬ 
tion 13.3. In the last section, we introduce the concepts of crossing number 
and graph thickness, and we discuss an application in electronic circuit design. 

13.1 Graphs 

In this section, we introduce some basic properties and results involving graphs. 
Along the way, we develop appropriate concepts to solve the Konigsberg 
bridges problem via graphs. 

In graph theory, a graph G is abstractly defined as a finite set Vc of vertices 
along with a finite set Eg of unordered pairs {u,, vj} of vertices. The elements 
of Eg are called the edges of G. 

In topological graph theory, a graph is regarded as a topological space G 
made up of a finite set of points Vc, called the vertices of G, along with a 
finite collection Eg of edges joining vertices. Each edge results from taking a 
closed bounded interval in R and gluing one endpoint to a vertex v, and gluing 
the other endpoint to a vertex Vj. (See Example 3.17.) 

There is a straightforward correspondence between the abstract graphs in 
graph theory and the topological graphs in topological graph theory. Through¬ 
out the chapter we do not distinguish between abstract graphs and topological 
graphs. We use the term graph to refer to both. In cases where the difference 
matters, it should be clear from the context which meaning is intended. 

Let G be a graph. If e is an edge of G with vertices and vj, then e and 
the vertices and vj are said to be incident to each other. Also, e is said to 
join the vertices Vj and Vj. 


IMPORTANT NOTE: In a few places in this chapter we construct graphs by 
removing vertices or edges from a particular graph. In such cases, when we 
remove an edge from a graph, we are not necessarily removing the vertices 
incident to the edge unless we explicitly state that we are. 


408 
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A graph is a compact Hausdorff space. (See Exercise 13.1.) Therefore 
each one-point set in a graph is closed. In particular, each vertex is closed 
as a one-point set. Each edge in a graph is a compact subspace of the graph 
since it is the image of a compact space under a continuous function (the 
quotient mapping that glues the edges and vertices together). Since graphs are 
Hausdorff, it follows that each edge is a closed subset in the graph. 


EXAMPLE 13.1. Two important families of graphs are the complete graphs 
and the complete bipartite graphs. The complete graph on n vertices is the 
graph K n that has n vertices and a collection of edges such that each pair of 
distinct vertices is joined by a single edge. (See Figure 13.1.) 

The complete graph on n vertices models the handshake problem, which 
asks, “If n people shake each others’ hands, how many handshakes result?” 


Ko 





K s 


FIGURE 13.1: The complete graphs Ki, Kj, K 4 . and K 5 . 


The complete bipartite graph K mi „ is a graph having m+n vertices that 
can be divided into sets V m and V n of m and n vertices, respectively, such that 

(i) Each edge joins a vertex in V m to a vertex in V„, 

(ii) Each pair of vertices v e V m and v' e V n is joined by a single edge. 
Examples of complete bipartite graphs are shown in Figure 13.2. 




FIGURE 13.2: The complete bipartite graphs Ar ] 5 , # 2 , 3 , and £3 3. 

The graph ^ 3,3 models a popular pencil-and-paper puzzle, known as the 
three-utilities problem, which asks if it is possible (on paper) to connect three 
houses to three utilities (gas, water, and electricity) without having the lines be¬ 
tween the houses and the utilities cross. This amounts to asking if it is possible 
to embed ^3 3 in the plane. We discuss this problem further in Section 13.3. 
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DEFINITION 13.1. Let G be a graph. For each vertex v e G, define the 
degree of v to be the number of edges incident to v, counting an edge twice if 
it joins v to itself. 

We can think of the degree of a vertex v as the number of different ap¬ 
proaches to v along edges incident to v. 


EXAMPLE 13.2. In Figure 13.3 we display graphs with the degree of each 
vertex labeled. 



FIGURE 13.3: Degrees of vertices. 


EXAMPLE 13.3. In the complete graph on n vertices, K n , each vertex has 
degree n — 1. In the complete bipartite graph K m<n , there are m vertices of 
degree n that are joined by the graph’s edges to n vertices of degree m. 


For graphs, equivalence is defined as follows: 

DEFINITION 13.2. Graphs G and G' are isomorphic or graph equivalent 
if there is a bijective function f : Vq —> Vq' between the corresponding sets 
of vertices, such that for every v, w e Vg, the number of edges in G that join 
v and w is equal to the number of edges in G' that join f (u) and f (w). Such 
a function f is called a graph isomorphism. 

Given a graph isomorphism /, it is straightforward to see that / induces a 
homeomorphism /* mapping graph G to graph G' and mapping the vertices 
of G bijectively to the vertices of G'. The converse to this situation also holds. 
Specifically, we have the following theorem: 

THEOREM 13.3. Let G and G' be graphs with vertex sets Vq and Vq>, 
respectively. If there is a homeomorphism h : G —> G' that maps Vq bijectively 
to Vq’, then G and G' are isomorphic and the function h y : Vg —> Vq’, defined 
by hy{v) = h(v), is a graph isomorphism. 


Proof. See Exercise 13.3. 
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If we drop the assumption in Theorem 13.3 that h maps the vertex set of G 
bijectively to the vertex set of G', then it does not necessarily follow that G and 
G' are isomorphic. For example, the graphs in Figure 13.4 are homeomorphic, 
but not isomorphic. 



FIGURE 13.4: Graphs G and G' are homeomorphic, but not isomorphic. 

Summarizing the preceding definition, theorem, and example, we have two 
notions of equivalence for graphs G and G 

(i) Topological Equivalence: Graphs G and G' are homeomorphic. 

(ii) Graph Equivalence: Graphs G and G' are homeomorphic by a 
homeomorphism that maps the vertex set of G bijectively to the 
vertex set of G'. 

In general, we allow graphs to have parallel edges (two or more edges that 
join a pair of vertices) and loops (edges that join a vertex to itself). But we are 
also interested in graphs where neither of these occur: 

DEFINITION 13.4. A graph is said to be simple if it has no parallel edges 
and no loops. 

In Figure 13.5, graphs G\ and G 2 are simple, but graphs G 3 and G 4 are 

not. 






FIGURE 13.5: The first two graphs are simple, but the second two are not. 

THEOREM 13.5. If G is a graph, then there is a simple graph G' that is 
homeomorphic to G. 

Proof. Let G be a graph. Create a new graph G' as follows. For each 
edge e in G, joining vertices v and v\ add two new distinct vertices, 
ui and t> 2 , and replace e with three new distinct edges, e\ , e 2 , and e-$, 
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such that e\ joins v and v\, ei joins v\ and vi, and ei joins V 2 and v'. 
(See Figure 13.6.) The resulting graph G' is homeomorphic to G and is 
simple. B 




FIGURE 13.6: Adding edges and vertices to make a simple graph. 

A walk is the graph-theory equivalent of a path. It is defined as follows: 

DEFINITION 13.6. Let G be a graph and v and v' be vertices of G. A walk 
from v to v' is an alternating sequence of vertices and edges, 

v = vo, eo, vi, e\,... , v n -\,e n -\, v„ = v\ 

such that edge e t joins vertices v/ and u,+i for each i = 0. n — 1. 

The following definition introduces some different types of walks that are 
of interest in graph theory: 

DEFINITION 13.7. Let G be a graph. 

(i) A closed walk is a walk that begins and ends at the same vertex. 

(ii) A closed walk in which no edges repeat is called a circuit. 

(iii) A circuit in which no vertices repeat (except the first and last) is 
called a cycle. 


EXAMPLE 13.4. In the graph shown in Figure 13.7, the walk that corresponds 
to following edges 1-7 in order is a circuit that is not a cycle, and the walk that 
corresponds to following edges 1-4 in order is a cycle. 


2 1 5 



FIGURE 13.7: A circuit (1-7) and a cycle (1-4). 


j 
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DEFINITION 13.8. Let G be a graph. 

(i) If G has no cycles, then we say that G is acyclic. 

(ii) If G has at least one cycle, then the girth of G is the minimum 
number of edges in any cycle in G. 

For example, if m, n > 2, then the girth of the complete bipartite graph 
K Him is 4, and if n > 3, then the girth of the complete graph K n is 3. 

The following theorem provides a condition that guarantees that a graph 
has at least one cycle: 

THEOREM 13.9. If each vertex in a graph G has a degree of at least 2, then 
there is a cycle in G. 

Proof. Pick an edge eo of G. Let uo and t>i be the vertices incident to co¬ 
if vo = V ], then we are done, since no. e 0. ni is then a cycle. Otherwise, 
let e\ be an edge incident to v\ and not equal to eo- This is possible since 
each vertex has a degree of at least 2. Now set V 2 equal to the vertex 
joined to iq by e\. If V 2 = v\ or V 2 = no. then, as before, we are done. 
Otherwise, pick an edge e 2 incident to V 2 and not equal to e\. 

We continue this process. Since there are only finitely many vertices 
in G, at some point we must have a walk, 

no, eo, ni, e\,... , n m _i,e m _i, v m , 

with no repeating edges, with no repeating vertices up through n m _i, and 
withn m = Vk for some k between 0 and m — 1. Then Vk , ek, ■ • ■ , e m _i, n m 
is a cycle in G. ■ 


DEFINITION 13.10. Let G be a graph. An Eulerian walk in G is a walk 
that visits each edge in G exactly once. An Eulerian circuit in G is a closed 
Eulerian walk in G. We say that G is Eulerian if it contains an Eulerian circuit. 


In Figure 13.8, an Eulerian walk is depicted in the graph on the left (follow¬ 
ing the edges in numerical order as shown) and an Eulerian circuit is depicted 
in the graph on the right. 



FIGURE 13.8: An Eulerian walk and an Eulerian circuit. 
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The Konigsberg bridges problem, introduced in Section 0.2, asks if there 
is an Eulerian walk in the associated graph K that models the land regions 
(vertices) and bridges (edges) in Konigsberg. (See Figure 13.9.) 




FIGURE 13.9: Modeling Konigsberg with graph K. 

In his solution to the Konigsberg bridges problem (without using the yet-to- 
be-developed language of graph theory), Leonhard Euler essentially identified 
necessary and sufficient conditions for a graph to possess an Eulerian walk 
or circuit, and he proved the necessity. We state the graph theory version of 
Euler’s result in the following theorem: 

THEOREM 13.11. Let G be a graph that is connected. 

(i) There is an Eulerian circuit in G if and only if all of the vertices of 
G have even degree. 

(ii) There is an Eulerian walk, but not an Eulerian circuit, in G if and 
only if G has exactly two vertices with odd degree. Such a walk 
must begin at one of the vertices with odd degree and must end at 
the other. 

We prove the necessity implication from Theorem 13.1 l(i); that is, we 
prove that if a graph has an Eulerian circuit, then all of the vertices of G have 
even degree. The converse is the subject of a set of supplementary exercises at 
the end of the section. The proof for (ii) is similar to that for (i). 

Proof of Necessity in (i). Suppose G is a graph and 


vo. eo,vi,e\,... , v„_i, e n -\, v n = vo 

is an Eulerian circuit in G. We prove that every vertex of G has even 
degree. 

Let v be a vertex, other than vq, in the circuit. Every time v is visited 
on the circuit, it is approached on one edge incident to it and departed 
from on another. Each of these approaches and departures contributes 1 
to the degree of v, and therefore each visit to v counts 2 toward its degree. 
Since each edge is visited exactly once on a circuit, the degree of v is 2 
times the number of visits to v on the circuit. Consequently, the degree 
of v is even. 
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For the vertex t’o, the same argument implies that the degree is 1 (for 
edge eo) plus 1 (for edge e„_i) plus 2 times the number of visits to up 
between the start and end of the circuit. Thus the degree of uq is also 
even. ■ 


Theorem 13.11 provides us with a solution to the Konigsberg bridges prob¬ 
lem. Since the Konigsberg graph K has four vertices of odd degree, it follows 
that there is neither an Eulerian circuit nor an Eulerian walk in K. Therefore 
it is not possible to take a walk through Konigsberg that crosses each bridge 
exactly once. 

Exercises for Section 13.1 

13.1. Let G be a graph. Prove that G is compact and Hausdorff. 

13.2. Determine how many edges there are in the graphs K„ and K„ M . Justify your 
assertion. 

13.3. Prove Theorem 13.3: Let G and G' be graphs with vertex sets Vq and Vq\ 
respectively. If there is a homeomorphism h : G —> G' that maps Vc bijectively 
to Vq', then G and G' are isomorphic and the function hy : Vc —>■ Vg', defined 
by hy(v ) = h(v), is a graph isomorphism. 

13.4. Prove that if G is a graph, then every component of G is a graph. 

13.5. (a) For which n are the graphs K n Eulerian? Justify your answer. 

(b) For which m and n are the graphs K mM Eulerian? Justify your answer. 

13.6. Let G be a graph. Prove that G has an even number of vertices having odd 
degree. 

13.7. Draw all of the acyclic graphs (up to graph isomorphism) having eight or fewer 
edges. Indicate which of these are homeomorphic. 

13.8. Let G be an acyclic graph having V vertices and E edges. Prove that E < V. 

13.9. (a) Prove that if any one of the seven bridges is removed from Konigsberg, 

then it is possible to walk through the city crossing each bridge exactly 
once. 

(b) Prove that if a new bridge is built between any two distinct land regions 
in Konigsberg, then it is possible to walk through the city crossing each 
bridge exactly once. 

(c) Prove that you can take any one of the seven bridges in Konigsberg and 
move it to a new location so that after the move has been made, it is possible 
to walk through the city crossing each bridge exactly once, beginning and 
ending in the same place. 

13.10. For each graph shown in Figure 13.10, use Theorem 13.11 to determine whether 
the graph has an Eulerian walk, an Eulerian circuit, or neither. If the graph has 
an Eulerian walk or an Eulerian circuit, sketch it on the graph. 
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FIGURE 13.10: In each, is there an Eulerian walk or an Eulerian circuit? 

Supplementary Exercises: The Existence of an 
Eulerian Circuit 

In these exercises we prove the sufficiency implication in Theorem 13.1 l(i). That is, 
we prove that if G is a connected graph and every vertex in G has even degree, then 
there is an Eulerian circuit in G. The proof proceeds by induction on the number of 
edges in G. 

SE 13.11. Prove the theorem in the case where G has one edge. 

Next, assume that G has n edges and that the result is true for graphs having between 
one and n — 1 edges. We prove that there is an Eulerian circuit in G. By Theorem 13.9 
there is a cycle 


no, eo,v\,e\,... , Vj-i,ej-i,Vj = no 

in G. Each edge in the cycle is visited exactly once, but the cycle need not visit every 
edge in the graph. Let E* be the set of edges in the cycle, and let V* be the set of 
vertices in the cycle that are incident only to edges in E*. Set G' equal to the graph 
obtained from G by removing all of the edges in E* and all of the vertices in V*. 

SE 13.12. Prove that each vertex in G' has even degree. 

Now, G' is not necessarily connected, but each component of it is. Let C be a component 
of G'. It follows that C is a graph (see Exercise 13.4) and is connected. Furthermore, 
C has between one and n — 1 edges, and each vertex in C has even degree. Therefore, 
by the induction hypothesis, C has an Eulerian circuit. 

SE 13.13. Prove the existence of an Eulerian circuit in G. 

Thus, by induction, it follows that if every vertex in a graph G has even degree, then 
G has an Eulerian circuit. 


13.2 Chemical Graph Theory 

Graph models are used extensively in chemistry. Typically, the atoms in a 
molecule are represented by the vertices in a graph, and the bonds are rep¬ 
resented by edges. A fundamental goal is to determine the properties of a 
molecule that can be predicted from the properties of a graph model of it. In 
this section, we present a particular example where the boiling point of a class 
of hydrocarbons, known as alkanes, can be accurately predicted using graph 
models of the molecules. 
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In chemistry, a descriptor is a modeling tool that is used to predict properties 
of molecules from quantitative aspects of molecular structure. The collection 
of descriptors yields what is known as the quantitative structure-property re¬ 
lationship (QSPR). Hundreds of QSPR descriptors have been identified for 
the analysis of molecules, and they fall into five general categories: constitu¬ 
tional, topological, electrostatic, geometrical, and quantum-chemical. Some 
of the topological descriptors are derived from properties of graphs model¬ 
ing molecules. We present one of them here, the Wiener index, named after 
Harry Wiener who first proposed its use in chemical modeling in 1947. Our 
presentation of this application is based on the article [Rou]. 

DEFINITION 13.12. Let G be a graph. 

(i) For vertices v and v' in G, we define the distance between v and 
v' to be the minimum number of edges in a walk from v to v' in G. 

(ii) The Wiener index ofG, denoted W ( G), is the sum of the distances 
between each pair of vertices in G. 


EXAMPLE 13.5. In Figure 13.11, four graphs are shown along with their 
Wiener indices. 





FIGURE 13.11: Wiener indices of graphs. 


The graphs C and D shown in Figure 13.11 are topologically equivalent, 
but their Wiener indices are not equal. Thus the Wiener index is not preserved 
under topological equivalence. However, as the following theorem indicates, 
it is preserved under graph isomorphism: 


THEOREM 13.13. If graphs G and G' are isomorphic, then W (G) = W (G 1 ). 
Proof. See Exercise 13.15. ■ 


We apply the Wiener index to the class of molecules known as alkanes. 
These are straight-chain or branched-chain hydrocarbons in which each bond 
between carbon atoms is a single, covalent bond. (What we mean by straight- 
chain and branched-chain should become clear in what follows.) 
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The simplest alkanes are methane, ethane, and propane. These are all 
straight-chain hydrocarbons. Representations of each are shown in Figure 13.12. 


H 

H H 

H H H 

hCh 

hC-Ch 

h C-C-Ch 

H 

H H 

H H H 

methane 

ethane 

propane 


FIGURE 13.12: The alkanes methane, ethane, and propane. 

In an alkane, each carbon atom makes a total of four bonds with other 
carbon and hydrogen atoms. There is a backbone of bonded carbon atoms, and 
the remaining bonds for each carbon atom are taken up by hydrogen atoms. 

From a particular alkane representation, we can form representations that 
have an additional carbon atom by inserting, between a carbon and hydrogen in 
the original representation, a new carbon with two hydrogens attached. This is 
an operation on representations, rather than a chemical process. Thus from the 
representation for propane, we can form representations of two different alka¬ 
nes containing four carbons: butane and 2-methylpropane. (See Figure 13.13.) 
The molecule 2-methylpropane is the simplest branched-chain alkane. 
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2-methylpropane 


FIGURE 13.13: The two four-carbon alkanes: butane and 2-methylpropane. 

From the alkanes with four carbons, we can construct three five-carbon 
alkanes, as shown in Figure 13.14. 
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FIGURE 13.14: The three five-carbon alkanes. 
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We model the alkanes with graphs such that each carbon atom corresponds 
to a vertex and each bond between carbon atoms corresponds to an edge. Thus 
the bonded-carbon backbone determines the graph. We ignore the hydrogen 
atoms and the hydrogen-carbon bonds since they are implicit, given the ar¬ 
rangement of the carbon atoms. The graph models for the alkanes that contain 
between two and five carbon atoms are shown in Figure 13.15. 


ethane 


propane 


n-butane 



2-methylpropane 


n-pentane 



2-methylbutane 


2,2-dimethylpropane 


FIGURE 13.15: Graph models of the alkanes with between two and five carbon 
atoms. 

We can construct the graph models for alkanes having six or more car¬ 
bons by appropriately adding vertices and edges to given alkane graphs. In 
Figure 13.16, we show the graph models for all of the six- and seven-carbon 
alkanes. 



2-methylpentane 3-methylpentane 



2,3-dimethylbutane 2,2-dimethyIbutane 


n-heptane 



2-methylhexane 3-methyIhexane 2,2-dimethylpentane 



3,3-dimethylpentane 2,3-dimethyIpentane 2,4-dimethylpentane 


3-ethylpentane 2,2,3-trimethylbutane 


FIGURE 13.16: Graph models of the alkanes with six or seven carbon atoms. 
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In the table in Figure 13.17, we list all of the alkanes containing between 
two and seven carbon atoms, and for each we include the boiling point of the 
molecule (in Kelvin) and the Wiener index of the graph representation of the 
molecule. 


Name 

WI 

BP (K) 

ethane 

1 

184 

propane 

4 

233 

n-butane 

10 

272 

2-methylpropane 

9 

261 

n-pentane 

20 

309 

2-methylbutane 

18 

301 

2,2-dimethylpropane 

16 

283 

n-hexane 

35 

342 

2-methylpentane 

32 

333 

3-methylpentane 

31 

336 


2,3-dimethylbutane 

29 

331 

2,2-dimethylbutane 

28 

323 

n-heptane 

56 

371 

2-methylhexane 

52 

363 

3-methylhexane 

50 

365 

2,2-dimethylpentane 

46 

352 

3,3-dimethylpentane 

44 

359 

2,3-dimethylpentane 

46 

363 

2,4-dimethylpentane 

48 

354 

3-ethylpentane 

48 

366 

2,2,3-trimethylbutane 

42 

354 


FIGURE 13.17: Boiling point and Wiener index data for alkanes. 

In Figure 13.18, we display a graph of the boiling point and Wiener in¬ 
dex data. It is apparent that there is a strong correlation between these two 
variables. This is the sort of relationship that is beneficial in chemical model¬ 
ing, a relationship where a quantitative property of a mathematical model of a 
molecule is closely related to a physical property of the molecule itself. 
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FIGURE 13.18: A graph of the boiling point and Wiener index data. 

The boiling point versus Wiener index relationship can be closely approx¬ 
imated by an increasing curve. Fitting a power equation B = a W& to the data, 
we find that the relationship between boiling point (B) and Wiener index (W) 
for the data is approximated by B = 181 W °- 1775 . (See Figure 13.19.) 

Octane is the straight-chain alkane with eight carbon atoms. The Wiener 
index of its graph representation is 84. Using our power equation, we obtain 
397K as an approximation to octane’s boiling point. This is in good agreement 
with the known boiling point of 399K. 
















































































Section 13.2 Chemical Graph Theory 421 



FIGURE 13.19: Fitting a power equation to the data. 


Of course, it is not in “predicting” the properties of molecules whose at¬ 
tributes are already known that topological tools such as the Wiener index 
are useful. It is in predicting the properties and behaviors of molecules yet 
to be synthesized where such modeling approaches are beneficial. “Will the 
molecule behave as intended?” and “Will the molecule have minimal toxic¬ 
ity?” are important questions in chemical design. It can be time-consuming 
and expensive to synthesize molecules and test them in order to answer these 
questions. The quantitative structure-property relationship and the modeling 
tools within it provide important information to help anticipate a molecule’s 
characteristics. 

Exercises for Section 13.2 

13.14. (a) Determine the Wiener index of the complete graph on n vertices, K„. 

(b) Determine the Wiener index of the complete bipartite graph, K n m . 

13.15. (a) Prove Theorem 13.13: If graphs G and G' are isomorphic, then W(G) = 

W(G'). 

(b) Show that the converse of Theorem 13.13 does not hold. That is. find 
graphs G and G’ demonstrating that W (G) = W(G') does not necessarily 
imply that G is isomorphic to G'. 

13.16. (a) Determine the Wiener index of the graph S n C K whose vertices are the 

integers 1, 2, 3,... , n, and whose edges are the intervals connecting them, 
[1,2], [2, 3].[n — 1, n], 

(b) Develop a relationship between the boiling point and number of carbon 
atoms, n, for the straight-chain alkane A„ whose graph representation is 
given by the graph S„ in (a). 

(c) Nonane is the straight-chain alkane with nine carbon atoms. Use your 
relationship from part (b) to approximate the boiling point of nonane. 
Compare your result with the actual value of 424K. 

(d) Decane is the straight-chain alkane with 10 carbon atoms. Use your rela¬ 
tionship from part (b) to approximate the boiling point of decane. Compare 
your result with the actual value of 447K. 
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13.17. In the table in Figure 13.20, we list all of the alkanes containing eight carbon 
atoms, along with their boiling points and associated Wiener indices. Add this 
data to the data for alkanes containing between two and seven carbon atoms, and 
use data-analysis software to derive a power equation B = aW^ approximating 
the boiling point versus Wiener index relationship for alkanes having between 
two and eight carbon atoms. 


Name 

WI 

BP (K) 

«-octane 

84 

399 


79. 



76 

392 

VRHlilSHi 

75 

391 

2,2-dimethylhexane 

71 

380 

2,3-dimethylhexane 

70 

389 

2,4-dimethylhexane 

71 

382 

2,5-dimethylhexane 

74 

382 


3,4-dimethylhexane 

68 

391 

3,3-dimethylhexane 

67 

385 

3-ethylhexane 

72 

392 

2,2,3-trimethylpentane 

63 

383 

2,3,4-trimethylpentane 

65 

386 

3-ethyl-2-methylpentane 

67 

389 

2,2,4-trimethylpentane 

66 

372 

2,3,3-trimethylpentane 

62 

388 


64 

391 


FIGURE 13.20: Boiling point and Wiener index data for the eight-carbon alkanes. 


13.3 Graph Embeddings 

Graph-embedding questions and results play a major role in topological graph 
theory. In this section we introduce a few examples of such results, including 
Kuratowski’s Theorem, which provides necessary and sufficient conditions for 
a graph to be embeddable in the plane. 

To begin, we have the following theorem, which indicates that no matter 
how many vertices and edges a graph has, there is enough room in R 3 to create 
a copy of the graph: 

THEOREM 13.14. Every graph G can be embedded in R 3 . 

Proof. Let G be a graph with n vertices, tq,... , v n , and m edges, 
«i,... , e m . We define an embedding / : G -*■ R 3 . Begin by defin¬ 
ing f{vi) = (0, 0, *), for each i = 1,... , n. In this manner, the set of 
vertices is mapped bijectively to a set of points on the z-axis in R 3 . 

Let Pi,... , P m be a set of distinct half planes in R 3 , each emanating 
from the z-axis. Now, if e^ is an edge in G, joining vertices «,• and Vj , then 
extend / to ek so that it maps e* homeomorphically onto the semicircle 
in Pk running from (0, 0, i) to (0, 0, j). (See Figure 13.21.) The map /, 
so defined, is an embedding of G in K 3 . ■ 

Given that all graphs can be embedded in M 3 , we next consider embeddings 
in the plane. Can we still obtain an embedding for every graph? We will see 
that the answer is no, motivating the following definition: 


DEFINITION 13.15. A graph G is called planar if it can be embedded in the 
plane; otherwise it is called nonplanar. 
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FIGURE 13.21: Theedgee^ maps homeomorphically to a semicircle in half plane P 

In Section 13.1 we introduced the three-utilities problem, which essen¬ 
tially asks if it is possible to embed the graph £ 3,3 in the plane. After some 
experimentation, it is apparent that the three-utilities problem does .not have a 
solution, and therefore we conjecture that ^3 3 is nonplanar. If that is the c.ase, 
then it follows that no graph K mn with m, n > 3 is planar because ^3 3 can 
be embedded in each such graph. 

On the other hand, we can see from Figure 13.22 that all complete bipartite 
graphs K\ n and are planar. 



FIGURE 13.22: The complete bipartite graphs K and K 2 ,„ are planar. 

Now, consider the complete graphs. For n < 4, we can embed K n in the 
plane as illustrated in Figure 13.23. But, if we experiment with K$, we are 
led to conjecture that K 5 is nonplanar. And if that is the case, then it follows 
that none of the complete graphs K n with n > 5 are planar since K$ can be 
embedded in each of those graphs. 

Thus it seems as if # 3 , 3 , K 5 , and every graph containing either ^ 3,3 or 
K 5 as a subspace are nonplanar. The following important topological graph 
theory result indicates that these are exactly the nonplanar graphs: 


THEOREM 13.16. KuratOWSki’s Theorem. A graph is planar if and 
only if it contains no subspace homeomorphic to £ 3,3 and no subspace home- 
omorphic to Ks. 
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FIGURE 13.23: The complete graphs K^, K 3 , and £4 are planar. 

Kazimierz Kuratowski (1896-1980) was a major contributor to the devel¬ 
opment of the field of topology in the early twentieth century. In his 1930 paper, 
“Sur le probleme des courbes gauches en topologie,” he proved the foregoing 
theorem that now bears his name. Since 1930 a number of alternative and 
simplified proofs of the theorem have been published. (See [Tho] and [Mak], 
for exc tuple.) 

Here we prove half of Kuratowski’s Theorem—that if a graph contains a 
subspace homeomorphic to either £3 3 or K 5 then it is nonplanar. We do not 
prove the other implication of Kuratowski’s Theorem—that if a graph G is 
nonplanar, then either K 3 3 or K 5 can be embedded in G; it would take us too 
far off course to introduce all of the details needed to do so. 

If G is an embedded graph in the plane, then the complement of G is made 
up of a number of components; we call these the faces of the embedded graph. 
(See Figure 13.24.) Furthermore, since G is compact, it is a bounded subset 
of the plane, and therefore there is exactly one unbounded face of G. (See 
Exercise 6.19.) 



FIGURE 13.24: A graph in the plane separates the plane into faces, /,-. 

Consider the planar graphs in Figure 13.24 and Figure 13.25. If we count 
vertices, edges, and faces, and let the total of each be V , E, and F, respectively, 
then in all three cases we obtain the relationship V — E + F = 2. It is not a 
coincidence that the result of V — E + F is 2 in each of these examples. This 
relationship is a consequence of Euler’s formula for planar graphs, a result that 
we establish in Theorem 13.18. 

Our proof of Euler’s formula is based on the next theorem, which is a 
general separation and nonseparation theorem for graphs in the plane that 
builds on the Jordan Curve Theorem (Theorem 11.2) and Theorem 11.10. 
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FIGURE 13.25: The relationship V - E + F = 2 holds for both of these graphs. 

Although the theorem is intuitively clear, like the Jordan Curve Theorem it is 
not straightforward to prove. We do not include a proof. 

The setup for the theorem is as follows: Let G be a graph in the plane. We 
are going to add an edge e to G to obtain a new graph G' in the plane. We 
assume that the vertices of e lie in G, but otherwise e does not intersect G. 
It follows that the interior of e (relative to G') lies in a face / of G. We are 
interested in knowing how the addition of e to the graph impacts the face /. 

THEOREM 13.17. Let G, e, f, and G' be as given. 

(i) If the vertices of e lie in separate components of G, then f — e is 
a face f of G', and e lies in the boundary off. 

(ii) If the vertices of e lie in the same component of G, then f — e is 
made up of two faces ofG', f and f", and e lies in the boundary 
of both f and f". 

Thus, in the first case in Theorem 13.17 the new edge e does not separate 
the face / of G, but in the second case it does. Note that in the second case 
it is possible that the vertices coincide. We illustrate the two cases from The¬ 
orem 13.17 in Figure 13.26, and we have drawn the edges as curves, rather 
than line segments, to emphasize the fact that this result is about general em¬ 
beddings of graphs in the plane, not just those whose image is made up of line 
segments. 



FIGURE 13.26: In the first case the new edge e does not separate /, but in the 
second case it does. 
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The relationship V — E + F — 2 holds for planar graphs that are connected. 
Euler’s formula for planar graphs, which we prove next, holds for all graphs 
in the plane. Note that if a graph is not connected, then it is a union of finitely 
many components, each of which is a connected graph. (See Exercise 13.4.) 


THEOREM 13.18. Euler’s Formula for Planar Graphs. Let G be a 

nonempty graph in the plane having C components. If there are V vertices, E 
edges, and F faces associated with G, then V — E + F = C+ 1. 


Proof. We prove the result by induction on the number of edges E. If 
E = 0, then G consists of V vertices and no edges. Thus G has V 
components; that is, C = V. Since the plane with a finite set of points 
removed is connected (see Exercise 6.43), G has a single face associated 
to it. Thus, F = 1. We have V — E + F= V + 1 = C + 1. Therefore 
Euler’s formula holds in the case where E = 0. 

Now assume that Euler’s formula holds for graphs G that lie in the 
plane and have E — 1 edges. Let G' be a graph that lies in the plane and 
has E > 1 edges. Assume that V, F, and C are the number of vertices, 
faces, and components, respectively, associated with G'. We consider 
two cases: either G' is acyclic or it is not. 

First assume that G' is acyclic. Pick a component H of G' containing 
at least one edge. The component H is an acyclic graph, and therefore 
it follows from Theorem 13.9 that H has a vertex of degree 1. Pick 
such a vertex v and let e be the edge incident tp it. Remove e from G' 
to obtain a graph G. The vertex v is a single-point component of G, 
and G has one component more than G' has. Furthermore, G has V 
vertices and E — 1 edges. If we add e back, obtaining G' from G, then 
Theorem 13.17 implies that the number of faces is the same for G' and 
G. Therefore G has F faces. Applying the inductive hypothesis to G, 
we have V — (E — 1) + F = (C + 1) + 1. Thus, in this case we obtain 
V — E + F = C + 1, as desired. 

Now assume that G' has a cycle. Let e be an edge in a cycle in G'. 
Remove e from G' to obtain a graph G. Since e lies in a cycle, the removal 
of e from G' does not change the number of components of the graph, and 
the vertices of e lie in the same component of G. Furthermore, G has V 
vertices and E — 1 edges. If we add the edge e back, obtaining G' from 
G, then Theorem 13.17 implies that the number of faces of G' is one more 
than the number of faces of G. Thus G has F — 1 faces. Applying the 
inductive hypothesis to G, it follows that V — (E — 1) + F — 1 = C + 1; 
therefore V — E + F = C + 1. 

In both cases we obtain V — E + F = C + 1, and therefore Euler’s 
formula holds for graphs that lie in the plane and have E edges, assuming 
that it holds for graphs that lie in the plane and have E — 1 edges. Thus, 
by induction, the theorem follows. ■ 
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A nice consequence of Euler’s formula for planar graphs is Euler’s formula 
for polyhedra. If we count the number of faces F, edges E, and vertices V of 
a polyhedron, as in Figure 13.27, then we obtain V — E + F = 2. We can 
see why this holds by removing a point from a face in the polyhedron and then 
mapping the resulting space homeomorphically to the plane. The vertices and 
edges of the polyhedron map to a connected graph in the plane, and the faces 
of the polyhedron map to the faces of the graph. Therefore, by Theorem 13.18 
it follows that V — E + F = 2. 



FIGURE 13.27: The relationship V - E + F = 2 holds for polyhedra. 


The following lemmas are subsequently employed to derive a useful bound 
(Theorem 13.21) on the number of edges in a planar graph: 

LEMMA 13.19. Let G be a graph in the plane. 

(i) If e is an edge that does not lie in a cycle in G, then e lies in the 
boundary of a single face of G. 

(ii) Ife is an edge that lies in a cycle in G, then e lies in the boundary 
of two faces of G. 

Proof. See Exercise 13.22. ■ 


LEMMA 13.20. Let G be a graph in the plane. If G has more than one face, 
then the boundary of every face of G contains a cycle. 

Lemma 13.20 does not imply that the boundary of each face is a cycle, but 
just that each face has a cycle in its boundary. Examining Figure 13.24, we 
see examples where the boundary of a face is not a cycle. In particular, in the 
boundary of face fa and in the boundary of face fa there are edges that are not 
part of any cycle in the graph, and in the boundary of face fa there are two 
cycles. 
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Proof. Assume that G has more than one face, and let / be a face of G. 
Let H be the graph made up of all of the edges and vertices of G that lie 
in the boundary of /. We claim that there is a cycle in H, and therefore 
there is a cycle of edges in G that lies in the boundary of /. 

First, we note that / is a face of H, and H has more than one face. 
Let p be a path from a point in / to a point in another face of H. If a 
is the last point on p in Cl(/), then a lies on an edge e of H, a lies in 
the boundary of /, and a lies in the boundary of at least one other face 
of H. It follows that e is in the boundary of more than one face of H, 
and therefore Lemma 13.19 implies that e lies in a cycle in H. Thus H 
contains a cycle, as we wished to show. ■ 

The following theorem provides us with a bound on the number of edges 
in a planar graph, expressed in relation to the number of vertices in the graph 
and the girth of the graph. Recall that the girth of a graph is the minimum 
number of edges in any cycle in the graph. 


THEOREM 13.21. Let G be a planar graph having V vertices, E edges, and 
at least one cycle. If g is the girth of G, and g > 3, then 


E < 


g(V~2 ) 
8-2 


Proof. Since G is planar, it can be embedded in the plane. We may 
assume that G itself lies in the plane. Each face associated with G has 
a cycle in its boundary (by Lemma 13.20), and each cycle consists of 
at least g edges; therefore we can think of each face as contributing 
at least g edges to G. But since each of these gF edges lies in a cycle, 
Lemma 13.19 implies that each such edge arises from two different faces. 
So the number of edges in G is at least gF/ 2. Hence gF/2 < E, and 
therefore F <2E/g. 

Substituting for F in Euler’s formula, we obtain 


2<C+l=V-E+F<V-£+ 


2 E 


Therefore, 


implying that 



V-2, 


gE-2E<g(V-2). 


Since g > 3, it follows that 


E < 


g(V~ 2) 

g~2 ‘ 
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COROLLARY 13.22. The graphs ^ 3,3 and K$ are nonplanar. 

Proof. For £ 3,3 we have V = 6 , E = 9, and g = 4. If we could embed 
K 3,3 in the plane, then by Theorem 13.21 we would have E < 2V — 4. 
But E and V do not satisfy this inequality; therefore £ 3,3 is nonplanar. 

Now consider K 5 . Here V = 5, E = 10, and g = 3. Since E and V 
do not satisfy the inequality E < 3 V — 6 , it follows that K 5 is nonplanar 
as well. ■ 


Corollary 13.22 yields half of Kuratowski’s Theorem. The graphs K 33 
and K 5 are nonplanar, and therefore if G is a graph containing either of these 
graphs as a subspace, then G is nonplanar as well. 


-i 

EXAMPLE 13.6. The Harary graphs # 4,6 and H 4 7 are shown in Figure 1 3.28. 
Are either of them planar? 




FIGURE 13.28: The Harary graphs 6 and # 4 , 7 . 

For # 4 , 6 , the answer is yes.- -In Figure 13.29 we show an embedding of 
# 4,6 in the plane, obtained simply by rearranging three of the edges in the 
representation of # 4,6 shown in Figure 13.28. 



FIGURE 13.29: An embedding of # 4,6 in the plane. 

For # 4 , 7 , the answer is no. In Figure 13.30 we illustrate an embedding of 
# 3,3 in # 4 , 7 . By Kuratowski’s Theorem, it follows that # 4,7 is nonplanar. 









Exercises for Section 13.3 

13.18. (a) Provide illustrations demonstrating that K 5 and £ 3,3 can be embedded in 

a Mobius band. 

(b) Provide illustrations demonstrating that K 5 and ^ 3,3 can be embedded in 
a torus. 

13.19. Let G be an acyclic graph. Prove that G is planar. 

13.20. Use Theorem 13.14 and Corollary 13.22 to prove thatIR 2 andR 3 arenothome- 
omorphic. 

13.21. Although # 3,3 is a simple graph that has six vertices and nine edges and is not 
planar, there exist other simple graphs that have six vertices and nine edges and 
are planar. Find one, and verify that (unlike ^ 3 , 3 ) it satisfies the inequality in 
Theorem 13.21. 

13.22. Prove Lemma 13.19: Let G be a graph in the plane. 

(a) If e is an edge that does not lie in a cycle in G, then e lies in the boundary 
of a single face of G. 

(b) If e is an edge that lies in a cycle in G, then e lies in the boundary of two 
faces of G. 

13.23. (a) Use Theorem 13.21 to prove that the Heawood graph H , shown in Fig¬ 

ure 13.31, is nonplanar. 

(b) Since the Heawood graph is nonplanar, Kuratowski’s Theorem implies 
that it must contain either a subspace homeomorphic to ^ 3,3 or a subspace 
homeomorphic to K 5 . Sketch such a subspace in a diagram of the Heawood 
graph. 



FIGURE 13.31: The Heawood graph H, and the Petersen graph P. 
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13.24. (a) Use Theorem 13.21 to prove that the Petersen graph P, shown in Fig¬ 

ure 13.31, is nonplanar. 

(b) Since the Petersen graph cannot be embedded in the plane, Kuratowski’s 
Theorem implies that it must contain either a subspace homeomorphic to 
^3 3 or a subspace homeomorphic to K$. Sketch such a subspace in a 
diagram of the Petersen graph. 

13.25. Construct a graph G as follows: Place a vertex on each square in a 4 x 4 
chessboard. Join two vertices with an edge if there is a knight move from 
the square associated with one vertex to the square associated with the other. 
(A knight move goes either two units horizontally and one unit vertically, or 
two units vertically and one unit horizontally.) Sketch the graph G, and either 
illustrate an embedding of G in the plane or prove that G is nonplanar. 

13.4 Crossing Number and Thickness 

In this section we introduce crossing number and thickness, two commonly 
studied measures of a graph’s failure to be planar. They are central topics in 
topological graph theory, and, as we show in an example, they are important 
considerations for the design of electronic circuits. 

Essentially, the crossing number of a graph is the minimum number of 
crossings needed in a drawing of the graph on paper. To make this formal, we 
need to be clear about what is meant by “drawing” and by “crossing.” 

First, a drawing of a graph G is a continuous function f : G -*■ R 2 . Of 
course, this notion of a drawing is too broad because it allows for the inclusion 
of fairly undesirable images of the graph in the plane. We want images that are 
as representative of the graph as possible, with as few crossings as possible. 
Thus, we restrict further: 

DEFINITION 13.23. A drawing f : G R 2 of a graph is called a good 
drawing if it satisfies the following conditions: 

(i) No point in the image of f corresponds to more than two points of 
the graph; 

(ii) There are only finitely many points in the image of f that corre¬ 
spond to two points of the graph. These are called crossing points 
of the drawing; 

(iii) No crossing point of the drawing corresponds to a vertex of the 
graph. 


THEOREM 13.24. Every graph G has a good drawing. 

Proof. Suppose we have a graph G. As in the proof of Theorem 13.14, 
we can embed G in R 3 so that the vertices of G map to points on the z-axis 
and the edges of G map to semicircles in half planes emanating from the 
z-axis. If we compose the embedding into R 3 with a projection onto a 
plane P in R 3 , we then have a drawing of G, but not necessarily a good 
drawing. However, if we choose P so that it is parallel to the z-axis and 
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is not perpendicular to any of the half planes containing the images of the 
edges of G, then the resulting drawing will satisfy, at least, requirements 
(ii) and (iii) to be a good drawing. Then, if there are points in the resulting 
drawing that correspond to more than two points in the graph, we can 
deform the corresponding semicircles slightly so that requirement (i) for 
a good drawing is satisfied as well. ■ 

Since all graphs have good drawings, we have the following definition. 

DEFINITION 13.25. Given a graph G, the crossing number of G, denoted 
v(G), is the minimum number of crossing points in any good drawing of G. 


The following theorem describes two essential facts concerning the cross¬ 
ing number: 

THEOREM 13.26. 

(i) Crossing number is a topological invariant of graphs; that is, if 
graphs G\ and G 2 are homeomorphic, then v(Gj) = v^Gf)- 

(ii) A graph can be embedded in the plane if and only if its crossing 
number is 0. 

Proof. The second part of the theorem is immediate. Consider the first 
part. Suppose that we have a good drawing d\ : G\ —> R 2 . We claim 
that there is a homeomorphism h : G 2 -> G\ such that the function d 2 , 
given by d 2 = d\ o h, is a good drawing of G 2 . Thus, for every good 
drawing d\ of G\ there is a corresponding good drawing t /2 of G 2 such 
that the images of d\ and d 2 are identical. It follows that the crossing 
numbers of G\ and G 2 are equal. 

To complete the proof, we need to establish the existence of the home¬ 
omorphism h : G 2 -*■ G\. Given h* : G 2 —> G\, a homeomorphism, 
the only way that d\ oh* could fail to yield a good drawing is if h* maps 
one or more degree -2 vertices of G 2 to points of G 1 that correspond to 
crossing points of the drawing d\. In such an instance, the homeomor¬ 
phism h* can be adjusted slightly to a homeomorphism h mapping all 
degree-2 vertices of G' to points of G that do not correspond to crossing 
points. (See Exercise 13.26.) ■ 


EXAMPLE 13.7. As we saw in the last section, each of the graphs K 5 , -£ 3 , 3 , 
and H 4 J is nonplanar, and therefore by Theorem 13.26 the crossing number 
of each is nonzero. In Figure 13.32 we show good drawings of each of these 
graphs, and in each case there is only one crossing. Thus the crossing number 
of each of these graphs is 1 . 




Section 13.4 Crossing Number and Thickness 433 



FIGURE 13.32: The graphs £ 5 , £ 33 , and H^j have crossing number 1. 
_ 1 


EXAMPLE 13.8. Consider the graph JC 3 4 shown in Figure 13.33. Since ^3 3 
embeds in £ 34 , the crossing number of £ 3,4 is at least 1. The good drawing 
of ^3 4 in Figure 13.33 has two crossings; therefore the crossing number of 
^ 3,4 is at most 2. Could the crossing number be 1 ? 



FIGURE 13.33: The graph K 34 has crossing number 2. 

We argue by contradiction that £ 3,4 cannot have crossing number 1. Sup¬ 
pose we have a good drawing of /C 3 4 with only one crossing. Then if we 
remove one of the edges involved in the crossing, the resulting graph is pla¬ 
nar (since we are eliminating the only crossing). However, if any one of the 
edges is removed from £ 3 , 4 , the result is still a graph in which ^ 3,3 can be 
embedded, and therefore ^ 3,4 with an edge removed is nonplanar. This is a 
contradiction, implying that there does not exist a good drawing of £ 3,4 with 
only one crossing. Thus the crossing number of £ 3,4 is 2. 


The problem of determining the crossing number for the complete bipartite 
graphs K m n is known as Turan’s brickyard problem, named after Paul Turan 
(1910-1976), a major contributor to graph theory. Turan encountered this 
problem in 1944, while working in a brick factory in a war labor camp near 
Budapest, Hungary. In [Tur], a welcome note in the first issue of the Journal 
of Graph Theory , he described the situation that led to his investigation of 
crossing numbers: 
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There were some kilns where the bricks were made and some open 
storage yards where the bricks were stored. All the kilns were 
connected by rail with all the storage yards. The bricks were carried 
on small wheeled trucks to the storage yards. All we had to do 
was put the bricks on the trucks at the kilns, push the trucks to 
the storage yards, and unload them there. We had a reasonable 
piece rate for the trucks, and the work itself was not difficult; the 
trouble was only the crossings. The trucks generally jumped the 
rails there, and the bricks fell out of them; in short, this caused a 
lot of trouble and loss of time which was rather precious to all of 
us (for reasons not to be discussed here). We were all sweating 
and cursing at such occasions, I too; but nolens-volens the idea 
occurred to me that this loss of time could have been minimized if 
the number of crossings of the rails had been minimized. But what 
is the minimum number of crossings? I realized after several days 
that the actual situation could have been improved, but the exact 
solution of the general problem with m kilns and n storage yards 
seemed to be very difficult and again I postponed my study of it to 
times when fears for my family would end. 

Although it has been investigated by many mathematicians since it was 
first posed by Turan, the problem of determining a general formula for the 
crossing number for K n m is still unsolved. 

While a graph with a crossing number of at least 1 cannot be embedded 
in the plane, the next theorem indicates that it can be embedded in a surface 
obrained by taking a connected sum of tori. (See Section 3.4.) 

THEOREM 13.27. If graph G has crossing number n, then G can be embed¬ 
ded in the connected sum of n tori. 

Proof. Take a good drawing of G with n crossings. Using the inverse of 
stereographic projection, we can map the good drawing from the plane 
to the sphere. In a neighborhood of each crossing in the resulting image 
in the sphere, put a tube (as illustrated in Figure 13.34), with one of 
the crossing strands running over the tube and the other crossing strand 
running through the tunnel in the surface that results from the addition 
of the tube. In this way the crossings have been removed, and the result 
is an embedding of G in a sphere with n tubes, a surface homeomorphic 
to a connected sum of n tori. ■ 



FIGURE 13.34 Removing a crossing by adding a tube. 
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Another measure of the failure of a graph to be planar is thickness, defined 
as follows: 

DEFINITION 13.28. The thickness of a graph G, denoted 0(G), is the small¬ 
est n such that G can be expressed as a union ofn planar graphs. 

It follows directly from the definition of thickness that a graph is planar if 
and only if its thickness is 1 . 


EXAMPLE 13.9. The graphs /C 3 3 and K$ have thickness 2. Why? Since 
these graphs are not planar, they each have a thickness of at least 2. Let K’ be 
the graph consisting of one edge from £ 33 , along with the vertices incident to 
that edge, and let K" be the graph consisting of the rest of the edges in £ 3 , 3 , 
along with all of the vertices in ^ 33 . Each of the graphs K' and K" is planar, 
and ^ 3,3 is the union of these two graphs; therefore 6 (Ks t f) = 2. Similarly, it 
follows that 0 (^ 5 ) = 2 . 


It is evident that thickness is invariant under graph isomorphism. However, 
thickness is not invariant under homeomorphism; that conclusion follows from 
the next two theorems. 


THEOREM 13.29. Let G be a nonplanar graph. There exists a graph G' 
that has thickness 2 and is homeomorphic to G. 

Proof. Consider the graph G' that is constructed from G by trisecting 
every edge in G, as in the proof of Theorem 13.5. (See Figure 13.6.) The 
graphs G and G' are homeomorphic. 

Now, let G 1 be the graph made up of all of the middle-third edges 
in G', along with all of the vertices incident to those edges. Let G 2 be 
the remainder of the edges in G', along with all of the vertices incident 
to those edges. Each of G \ and G 2 is planar (see Exercise 13.30), and it 
follows that G' has thickness 2. Q 


By Theorem 13.29, every nonplanar graph is homeomorphic to a graph 
with thickness 2. Does every nonplanar graph have thickness 2 itself? The 
answer is no; one consequence of the next result is that there are graphs with 
arbitrarily large thickness. (See Corollary 13.31 and Exercise 13.31.) 


THEOREM 13.30. Let G be a graph having E edges, V vertices, and girth 
g > 3. Then 


0(G) > 


E(g- 2) 
g(V- 2 )’ 





436 Chapter 13 Graphs and Topology 


Proof. Suppose that the thickness of G is 0 (G). Then G can be expressed 
as the union of 8 (G) planar graphs G;. We may assume that no two of the 
graphs Gi have an edge in common; otherwise we can remove a common 
edge from all but one of the graphs containing that edge, and we would 
still have G expressed as a union of 0(G) planar graphs. Let graph G; 
have Ei edges and V,- vertices. 

Suppose that graph G; has a cycle. If the girth of G,- is gi, then 
Theorem 13.21 implies that 


Et < 


gi(Vi- 2) < g(V — 2) 

gi -2 ~ g - 2 


The second inequality holds since V,- < V, gi > g, and the value of 
decreases as g increases. 

Now suppose that G, is acyclic. We claim that Ei < in this 

case as well. To begin, we note that £,• < V,- since G, is acyclic. (See 
Exercise 13.8.) Also, g < V generally holds, and therefore 

which implies that V < . Thus 


Ei <Vi <V < 


g(V- 2) 

g-2 ’ 


as we wished to show. 

Now, if we sum the inequalities Ei < over all Gi, then we 

obtain 


E < 0(G) 


g(V~ 2) 

g~2 


and from that the desired inequality follows. 


From Theorem 13.30 we obtain the following corollary regarding the thick¬ 
ness of the complete graphs and the complete bipartite graphs. 


COROLLARY 13.31. For K„ and K n ^ m , thickness is bounded from below as 
follows: 


(i) If n >3, then 9(K n ) > 

(ii) Ifn, m e Z+ and n + m >3, then 6(K n ^ m ) > 2 (n+m-2 ) 


Proof. See Exercise 13.29. 
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EXAMPLE 13.10. ByCorollary 13.31, the thickness of the complete bipartite 
graph K-jj is bounded from below by 

(7) (7) = 49 

2(7 + 7-2) 24' 

Therefore d(K-jj) > 3. In Figure 13.35 we present three planar graphs whose 
union is Kyj. In the figure, we view K-jj as the graph whose edges join each 
of the seven lettered vertices ( A-G ) to each of the other seven vertices. It 
follows that O(Kjj) = 3. 



FIGURE 13.35: The bipartite graph Kyj is the union of the three planar graphs 
shown. 


j 


EXAMPLE 13.11. An Application to Electronic Circuit Design. Printed 
circuit boards are one of the significant inventions that led to the rapid advance 
in electronic technology in the latter half of the twentieth century. A printed 
circuit board is a board (or collection of boards) holding electronic components 
that are connected to each other by conducting pathways, called etches or 
traces. Circuit boards are found in many electronic devices, from personal 
computers to DVD players to portable music players. They can hold up to 
thousands of components including resistors, capacitors, inductors, transistors, 
and integrated circuit chips (which themselves are miniature circuits). 

Each component on a printed circuit board has a number of wire leads that 
connect it to the conducting pathways. Components can have as few as one 
or two leads or, in the case of integrated circuit chips, as many as hundreds 
of them. The components are mounted on the printed circuit board and their 
leads are connected by conducting material, usually strips of copper that are 
etched into the board. 

Figure 13.36 illustrates a circuit diagram for a simple AM radio receiver. 
Usually, the characteristics of each component (for example, the voltage of a 
power supply) would be labeled in a such a diagram, but it is only the topological 
structure of the circuit that concerns us here. Notice that there is one place, 
toward the lower left in the diagram, where one conducting pathway passes 
over another. Is it possible to avoid having such an overpass by rearranging 
the circuit? We answer that question by considering a graph model. 
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FIGURE 13.36: A circuit diagram for an AM radio. 


A circuit intended for a printed circuit board can be naturally modeled by 
a graph. In Figure 13.37, we show a graph G that models the AM radio circuit 
in Figure 13.36. The vertices in G correspond to the circuit’s components 
and junctions (locations where conducting pathways meet). The edges in G 
represent the conducting pathways in the circuit. 



FIGURE 13.37: A graph model of the AM radio circuit. 

In Figure 13.38, we illustrate an embedding of £ 3,3 into the graph G. 
Therefore G is not embeddable in the plane, implying that every good drawing 



H 1 ) 

a 


FIGURE 13.38: The graph £33 embeds in G. 
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of G has at least one crossing point. It follows that there must be a conducting 
pathway overpass in any diagram of the circuit. Thus, construction of the AM 
radio circuit on a printed circuit board must involve drilling at least one pair of 
holes to allow a pathway to travel from, say, the top of the board to the bottom, 
under another pathway, and then back through to the top side of the board. 

Mounting the components and etching the conducting pathways are the 
easy, low-cost aspects of manufacturing a printed circuit board. Production 
costs rise when a circuit requires drilling holes to use both sides of a board or 
layering a number of boards together to use multiple flat surfaces. 

As we see in the AM radio circuit example, properties of graphs are helpful 
in planning a circuit layout and in determining how it can be constructed most 
economically on a printed circuit board. Crossing number and graph thickness 
are important graph properties that naturally play a role in printed circuit board 
design. The crossing number indicates the minimum number of crossings in 
a good drawing of a particular graph and therefore determines the minimum 
number of holes needed in a printed circuit board to pass circuit pathways under 
each other, ensuring no unwanted crossings. An alternative to drilling holes for 
crossings is to build the circuit in multiple layers, on the surfaces of a number 
of boards that are stacked together, insulated from each other, but electronically 
connected. For this approach, graph thickness provides an indication of the 
minimum number of surfaces needed to build a layered circuit. Thus, both 
crossing number and thickness can be valuable tools in designing and analyzing 
an electronic circuit so that it can be manufactured as economically as possible. 


Exercises for Section 13.4 

13.26. (a) In R, assume that a i <b\ < c i and <22 < £2 <C 2 - Prove that there exists 

a homeomorphism h : [<a;, c 1 ] —*■ [ 02 , 02 ] such that h{b\) = £> 2 - 
(b) Discuss the role that part (a) plays in the proof of Theorem 1 3.26, where 
it is asserted that the homeomorphism h* can be adjusted slightly to a 
homeomorphism h mapping all degree-2 vertices of G' to points of G that 
do not correspond to crossing points. 

13.27. Prove that for a graph G the thickness and crossing number are related as 
follows: 0(G) < 1 -I- rv(G)/81 where [Aj denotes the least integer greater 
than or equal to x. 

13.28. Prove that the bipartite graphs K^.n, with n > 3, and K^_ m , with m > 4, have 
thickness 2 . 

13.29. Prove Corollary 13.31: For K„ and K n . m thickness is bounded from below as 
follows: 

(a) If n > 3, then 0(K n ) > 5 ^ 35 ) • 

(b) If n, m € Z+ andn + m > 3, then6 (K„_ m ) > 2{ „™_ 2y 

13.30. Prove that the graphs G\ and G 2 , constructed in the proof of Theorem 13.29, 
are planar. 

13.31. Find a specific positive integer value k for which you can prove, using Corol¬ 
lary 13.31, that 9(Kicn) >: n for all n > 3. (This demonstrates that there 
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are graphs with arbitrarily large thickness. Specifically, given any n > 3, the 
complete graph Kk n is a graph with thickness at least n.) 

13.32. Consider the embedding of # 3,3 in the graph G illustrated in Figure 13.38. 
Sketch a diagram indicating how each individual edge in K 33 maps into G 
under the embedding. 

13.33. Consider the circuit diagram in Figure 13.39. Create a graph model of the circuit 
and prove that the circuit cannot be manufactured on one side of a circuit board. 



FIGURE 13.39: Prove that this circuit cannot be manufactured on one side of a 
circuit board. 

13.34. Consider the circuit diagram in Figure 13.40. Prove that the circuit can be 
rearranged so that two, but not all three, of the crossings can be removed. 



FIGURE 13.40: Prove that two, but not all three, of the crossings can be removed. 




Chapter 14 

Manifolds and Cosmology 


One of the great intellectual advances achieved by humanity was the realization 
of the shape of the planet on which we reside. It took hundreds of thousands 
of years of human development before we were able to conclude that we live 
on a sphere. 

Pythagoras posited a spherical earth as early as 500 B.C. A variety of 
observations suggested that the earth was curved. For example, masts were 
the last part of a ship to disappear on the horizon. Three hundred years later, 
Erastothenes used the shadows cast at noon in two different cities to estimate 
the curvature of the earth and from it, the earth’s radius. 

The fact that locally the surface of the earth is two-dimensional implies 
that it is a 2-manifold. We will define manifolds rigorously in the first section 
of this chapter. Other 2-manifolds we have already encountered include the 
torus, the Klein bottle, the projective plane, and the plane. 

A priori knowing the local two-dimensionality of the earth’s surface did 
not preclude the possibility that some 2-manifold other than a sphere corre¬ 
sponded to the earth. A torus earth certainly would have been interesting. (See 
Figure 14.1). 



FIGURE 14.1: A torus Earth. 


Since our realization of the shape of the earth, we have determined many 
other aspects of our universe. But one fundamental question remains unan¬ 
swered. What is the shape of the universe within which we reside? Here, 
we mean the spatial universe, which can be taken as a spatial slice of four¬ 
dimensional space-time. We can picture the spatial universe as having been 
created at the time of the big bang and expanding in size since then. We would 
like to know what topological shape it is that is expanding. 

In order to make sense of this question mathematically, we need to examine 
the fundamental nature of the spatial universe. What are its basic defining 
properties? 
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Notice that no matter where we have been in the universe so far, if we 
choose a given spot and travel out from it a short distance (say three feet) in 
all directions, we enclose a space that, for all intents and purposes, resembles 
a ball in three-dimensional Euclidean space. (See Figure 14.2.) 



FIGURE 14.2: The set of points within three feet of a point is a ball. 

The space that we live in appears to be locally three-dimensional. We call 
such an object a 3-manifold. Just as there are a variety of 2-manifolds, any one 
of which satisfies the same properties locally as does the surface of the earth, 
there are a variety of 3-manifolds, any of which is a candidate for the shape of 
the universe. 

In this chapter, we discuss manifolds of dimensions one, two, and three 
in the first three sections, and in the last two sections we discuss how an 
understanding of 3-manifolds helps in the determination of the structure of the 
universe. We begin with the official definition of an n-manifold. 

14.1 Manifolds 

We want to capture the idea of a space that locally resembles real n-space E”. 

DEFINITION 14.1. An n-manifold is a Hausdorff topological space X with 
a countable basis such that each x e X has a neighborhood that is homeo- 
morphic to the open n-ball. 

The fact that the neighborhoods are only required to be homeomorphic 
to the open n-ball gives us substantial leeway. A homeomorphism between a 
neighborhood and an open ball in E" need not preserve distance; it only needs 
to deform the neighborhood to look like an open ball. As in Figure 14.3, the 



FIGURE 14.3 Each point on the torus has a neighborhood that is homeomorphic 
to an open disk. 
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torus has the property that each point has a neighborhood homeomorphic to 
an open disk in the plane, although the neighborhoods appear curved on the 
surface. 

The two additional conditions that the space be Hausdorff and that it have a 
countable basis ensure that an n-manifold is a reasonably well-behaved space. 
(See Exercises 14.1 and 14.2.) 

When considering n -manifolds for a given n, the most fundamental ques¬ 
tions are the classification questions. Can we make a complete list of all 
^-manifolds? And, if such a list exists and we are given a particular n-manifold, 
is there an algorithm that will determine which manifold it is on the list? 

This turns out to be a very difficult problem. It has been successfully 
completed for dimensions one and two. There has been substantial work in 
dimensions three and four, but a complete classification of 3-manifolds remains 
elusive. And it has been proven that there can never be an algorithm that 
determines whether or not two given 4-manifolds are homeomorphic. Thus 
the complete classification of 4-manifolds can never be attained. The same 
holds for the classification of n-manifolds for any n > 4. (See [Mar] and 
[Boo].) 

Let us start with 1-manifolds. 

DEFINITION 14.2. A connected 1-manifold is called a curve. 

Each point in a curve has a neighborhood that is homeomorphic to the open 
interval (—1, 1). There is a complete classification for 1-manifolds: 

THEOREM 14.3. Let X be a l-manifold. 

(i) If X is connected and compact, then X is homeomorphic to S 1 . 

(ii) If X is connected and noncompact, then X is homeomorphic to R. 

(iii) IfX is disconnected, then X is the union of a countable collection 
of components, each of which is homeomorphic to either S 1 or R. 

Proofs of (i) and (ii) can be found in [Chr], It is straightforward that (iii) 
follows from (i) and (ii). Specifically, if a 1 -manifold X is not connected, then it 
is a union of components, each of which is a connected 1-manifold. Therefore, 
X is a union of mutually disjoint subspaces that are each homeomorphic to 
either S 1 or R. Since a manifold must have a countable basis, there can be at 
most countably many such components in X. 

Since we now have a good understanding of 1-manifolds, let us turn to 
2 -manifolds. 

DEFINITION 14.4. A connected 2-manifold is called a surface. 

The plane is a surface. It is connected and Hausdorff. Given a point 
p € R 2 , the open ball of radius 1 centered at p is a neighborhood of p that is 
homeomorphic to the open disk. Furthermore, the plane has a countable basis 
given by the set of open balls of rational radius centered at points x = (x\, xf) 
with both x\ and X 2 rational. 
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Every connected open subset of the plane is also a surface. In this section 
and the next, our primary focus is on compact surfaces. 

THEOREM 14.5. The sphere, the torus, the Klein bottle, and the projective 
plane are compact surfaces. 

Proof. We discuss the proof for the Klein bottle. The proof is essentially 
the same for the other three spaces. We saw earlier that the Klein bottle 
is connected and compact, being the image of a connected and compact 
space under a quotient mapping. 

Assume that we have a Klein bottle that is constructed from the 
square [0, 1] x [0, 1] by gluing the edges in the usual way. As illustrated 
in Figure 14.4, a point x in the interior of the square has an open-disk 
neighborhood obtained by taking an open ball of appropriately small 
radius. For a point y on an edge of the square (but not a vertex) and 
the point y' on the opposite edge with which it is identified, a half-disk 
neighborhood of y and a half-disk neighborhood of y' glue together to 
form an open-disk neighborhood of the resultant point in the Klein bottle. 
Finally, a vertex of the square has a quarter-disk neighborhood. The ver¬ 
tex is identified with the other three vertices, each of which has a similar 
quarter-disk neighborhood. These four quarter-disk neighborhoods glue 
together to form an open-disk neighborhood of the resultant point in the 
Klein bottle. Hence, every point in the Klein bottle has an open-disk 
neighborhood. 



FIGURE 14.4: Each point in the Klein bottle has a neighborhood homeomorphic to 
an open disk. 

The Klein bottle is Hausdorff. Given two points in the Klein bottle, 
we can separate them using sufficiently small open-disk neighborhoods, 
glued half-disk neighborhoods, or glued quarter-disk neighborhoods, as 
needed. 

Finally, the square, [0, 1] x [0, 1], has a countable basis B obtained 
by taking the intersection of the square with every open ball in the plane 
having rational radius and centered at points with rational coordinates. 
From B, take the balls that lie in the interior of the square, the half balls 
that are centered on the edges and do not contain vertices, and the quarter 
balls centered on the vertices, then map them into the Klein bottle, gluing 
the half balls and quarter balls appropriately. The result is a countable 
basis for the Klein bottle. ■ 
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We would like to use various attributes of surfaces to distinguish between 
them. One of these is the concept of orientability. To get a handle on ori- 
entability, we start with the prototype nonorientable space, the Mobius band. 

A Mobius band has the property that it can reverse orientation in the fol¬ 
lowing sense: Choose a start point on the core curve of the Mobius band M, 
and take a clockwise rotation in a neighborhood of that point, as illustrated on 
M in Figure 14.5. Now, let us travel around the core curve, pulling the clock¬ 
wise rotation along with us. By the time we have returned to our starting point, 
our clockwise rotation has been reversed into a counterclockwise rotation. We 
have reversed our orientation. 

On the other hand, consider the clockwise rotation on the annulus A in 
Figure 14.5. If we pull the rotation around the core curve, it does not get 
reversed. 



FIGURE 14.5: Pulling directions of rotation around the Mobius band and the annu¬ 
lus. 


The property of possessing a curve that reverses orientation is held by every 
surface that contains an embedded copy of the Mobius band. This motivates 
the following definition: 

DEFINITION 14.6. A surface that contains an embedded Mobius band is 
called nonorientable. A surface that does not contain an embedded Mobius 
band is called orientable. 

As indicated by the darker shaded regions in the Klein bottle and in the 
projective plane in Figure 14.6, both of these surfaces contain an embedded 
Mobius band, and therefore they are both nonorientable surfaces. 



FIGURE 14.6: The Klein bottle and the projective plane contain embedded Mobius 
bands. 
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The next theorem indicates that orientability is a topological invariant. 
Therefore we can use orientability to help distinguish between surfaces. 

THEOREM 14.7. Let Si and S 2 be homeomorphic surfaces. Then Sj is 
orientable if and only if S 2 is orientable. 

Proof. See Exercise 14.4. ■ 

In general, it is difficult to prove that a surface is orientable by directly prov¬ 
ing that it does not contain an embedded Mobius band. Therefore we take a dif¬ 
ferent approach to demonstrate orientability, one that involves triangulations— 
decompositions of a surface into triangles. After we discuss triangulations and 
some helpful related results, we will come back to the matter of orientability. 

Let r be a triangular region in the plane, and let S be a surface. If / : r -> S 
is an embedding, then we say that /(r) is a triangle in S. Also, the images 
under / of the edges and vertices of r are called the edges and vertices, 
respectively, of /(r). (See Figure 14.7.) 



FIGURE 14.7: A triangle in the surface S. 


DEFINITION 14.8. A triangulation T of a compact surface S is a collection 
of finitely many triangles in S that covers S and is such that any two triangles 
in T either do not intersect, intersect in a vertex of both triangles, or intersect 
in an edge of both triangles. 

In Figure 14.8, we show triangulations of the sphere, torus, Klein bottle, 
and projective plane. 



FIGURE 14.8: Triangulations of compact surfaces. 


THEOREM 14.9. Every compact surface has a triangulation. 
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This result was first proved by Tibor Rado in 1925. The proof is quite 
technical, and therefore we do not pursue it here. Proofs can be found in 
Rado’s original paper, [Rad], and, more recently, in the text [Moi], 

The next definition and theorem will be important for our work with trian¬ 
gulations in this and in the next section. 


DEFINITION 14.10. Let T be a triangulation of a compact surface S. Then 

(i) A triangulation T' is called a subdivision ofT if every triangle in 
T can be expressed as a union of triangles in T'. 

(ii) A triangulation T* is said to be equivalent to T if there is a home- 
omorphism h : 5 -> S that maps the triangles in T homeomorphi- 
cally to the triangles in T*. 


THEOREM 14.11. Let 1\ and Tj be triangulations of a compact surface S. 
There exist triangulations T[ and T^ that are equivalent to each other and are 
subdivisions of T\ andTj, respectively. 

Proof. Suppose we have triangulations 7] and 7-j of a compact surface 
S. The triangulations are said to be in general position if the following 
conditions hold: 

(i) Every vertex of each triangulation lies in the interior of a 
triangle of the other triangulation; 

(ii) Every edge of each triangulation intersects every edge of the 
other triangulation in at most finitely many points. 

If T\ and Ti are in general position, then we can construct a triangulation 
V that is a subdivision of both T\ and 72. The following steps are used 
to construct T'\ we illustrate them in Figure 14.9: 

(i) The vertices in T\ and the vertices in T 2 become vertices in 

r. 

(ii) Each point of intersection between an edge in T\ and an edge 
in 72 becomes a vertex in T'. 

(iii) Each edge that is in T\ or in T 2 either becomes an edge in V 
or is subdivided into edges in T' by vertices from step (b). 

(iv) New vertices are added in edges to form triangles out of re¬ 
gions that are bounded by only two edges. 

(v) New edges are added in'regions bounded by more than three 
edges to subdivide such regions into triangles. 
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© New vertices 
-Added edges 



T i— T 2 


T' 


FIGURE 14.9: Constructing a subdivision T' fromtriangulations T\ and T 2 . 

The resulting triangulation T' is a subdivision of both 7j and T 2 , 
and therefore the theorem holds in the case that T\ and T 2 are in general 
position. 

Next, suppose that 7j and T 2 are not in general position. Then we 
can make small adjustments to one of the triangulations, say T \, to obtain 
a new triangulation T* that is equivalent to Zj but is such that T* and 
T 2 are in general position. That we can do this is plausible, but it is a 
technical result that takes a lot of detail to develop formally. Thus let 
T* be such a triangulation, and let / : S -*■ S be a homeomorphism 
mapping 7j to T*. Since T* and T 2 are in general position, we can, as 
before, find a subdivision T 2 of both T* and T 2 . Then we can map T 2 by 
/ _1 to a subdivision T[ of 7j. In this way, we have obtained subdivisions 
7j' and T 2 of T\ and T 2 , respectively, such that 7j' and T 2 are equivalent. 


As illustrated in Figure 14.10, a triangle can be assigned a direction of 
rotation that can serve as the direction of rotation at each of its points and that 
does not result in the orientation-reversal issues encountered on the Mobius 
band. 



FIGURE 14.10: A direction of rotation on a triangle. 

Also, as shown in Figure 14.10, a direction of rotation on a triangle induces 
a direction on each of its edges. This leads us to the following definition: 
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DEFINITION 14.12. Let S be a compact surface and T be a triangulation of 
S. The triangulation T can be coherently oriented if each triangle in T can be 
assigned a direction of rotation such that if two triangles intersect in an edge, 
then the directions the edge inherits from each of the triangles are opposite 
each other. (See Figure 14.11.) 



FIGURE 14.11: Triangles in a coherently oriented triangulation. 


In this definition, the condition that the inherited edge directions do not 
match ensures that we can slide an orientation from one triangle across an edge 
to an adjacent triangle and have the corresponding orientations agree. 

In Figure 14.12 we depict coherently oriented triangulations of the sphere 
and the torus. In Figure 14.8 the triangulation of the projective plane appears 
similar to the triangulation of the sphere. However, the coherent orientation 
of the triangulation of the sphere does not carry over to the projective plane 
because when we glue the top and bottom edges of the disk to obtain the 
projective plane, there are triangles whose edges are glued together so that 
the orientations do not properly match. A similar situation occurs with the 
triangulations of the torus and the Klein bottle. 



FIGURE 14.12: Coherently oriented triangulations of the sphere and torus. 

The following theorem establishes the equivalence between being ori- 
entable and having a coherent orientation: 


THEOREM 14.13. A compact surface S is orientable if and only if it has a 
triangulation that can be coherently oriented. 
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Proof. We first show that if one triangulation of a compact surface S can 
be coherently oriented, then every triangulation of S can be coherently 
oriented. Specifically, let 7j and T 2 be triangulations of S, and suppose 
that T\ is coherently oriented. By Theorem 14.11, there exist subdivisions 
T[ and r 2 ' of T\ and Tj, respectively, such that T[ and 7^ are equivalent. 
It is evident that 

(i) Subdivision T[ inherits a coherent orientation from the coher¬ 
ent orientation of T\. 

(ii) Triangulation obtains a coherent orientation as a result of 
its equivalence to 7j'. 

(iii) Triangulation T 2 obtains a coherent orientation from the co¬ 
herent orientation of its subdivision Tf 

Now, suppose that S is a nonorientable compact surface and therefore 
contains an embedded Mobius band, M. We need to show that every 
triangulation of S cannot be coherently oriented. By the first part of 
the proof, it suffices to show that there exists one triangulation of S 
that cannot be coherently oriented. Let T be a triangulation of S. We 
can adjust and subdivide T, in a manner similar to the construction in 
the proof of Theorem 14.11, so that in the resulting triangulation T' of 
S there is a collection of triangles, T' M , whose union is the embedded 
Mobius band M. The fact that we can reverse a direction of rotation 
by pulling it around the core curve in a Mobius band implies that the 
triangles in T' M cannot be coherently oriented. Thus, the triangulation T' 
of S cannot be coherently oriented. 

Next, suppose that S is orientable. We need to show that it has a 
triangulation that can be coherently oriented. Suppose it does not, and 
let T be a triangulation of S. Then T cannot be coherently oriented. 
Starting with any triangle in T, choose a direction of rotation on it. Extend 
this direction of rotation to the adjacent triangles in a coherent manner. 
Continue this process over the triangles in T. Because T cannot be 
coherently oriented, at some point we end up with a direction of rotation 
on a triangle that is not coherent with a rotation already assigned to 
an adjacent triangle. We can then find a sequence of distinct triangles 
T ],... , Tfi such that 

(i) For each i — 1, ... , n — 1, r t - and r,+i have an edge in 
common and are coherently oriented with each other; 

(ii) Triangles x n and t\ have an edge in common but the rotation 
on x n is not coherent with the rotation on x\. 

Consider the strip running across each triangle r, as shown shaded 
in Figure 14.13. If we regard the strip as being constructed by gluing the 
triangles together, one at a time, then when we glue r„ to x \, the strip 
is glued with a half twist, making it a Mobius band. Therefore there is 
a Mobius band embedded in S, contradicting the assumption that S is 
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orientable. It follows that S has a triangulation that can be coherently 
oriented. ■ 



FIGURE 14.13: The shaded strip glues to form a Mobius band in the surface. 

We pointed out previously that the sphere and the torus each have tri¬ 
angulations that can be coherently oriented. (See Figure 14.12.) Therefore 
Theorem 14.13 implies that the sphere and torus are orientable. Since ori- 
entability is a topological property, it follows that we can distinguish these two 
surfaces from the Klein bottle and the projective plane. Specifically, neither the 
sphere nor the torus is homeomorphic to either the Klein bottle or the projective 
plane because the former two are orientable while the latter two are not. As in 
this example, orientability is a fundamental property that is used to distinguish 
between compact surfaces. In the next section, we use it, along with the Euler 
characteristic, to distinguish between all compact surfaces. 

Up to this point, we have introduced a somewhat limited collection of 
compact surfaces, but we can dramatically expand that collection using the 
connected-sum operation, defined as follows: 

DEFINITION 14.14. Given two surfaces S\ and S 2 , the connected sum of 
Si and S 2 , denoted Si#S 2 , is the surface obtained by removing the interior of 
a disk from each surface and gluing the two circle boundaries together via a 
quotient map. (See Figure 14.14.) 



FIGURE 14.14: The connected sum of two surfaces. 
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The result of the connected-sum operation is a surface. (See Exercise 14.9.) 
Although we do not prove it here, up to homeomorphism the connected-sum 
operation is independent of the particular disks whose interiors we remove from 
the surfaces and the particular choice of a gluing we use on the two boundary 
circles. 

Notice that when we take the connected sum of the sphere S 2 with any 
other surface S, the result is homeomorphic to S. (See Figure 14.15.) If we 
think of connected sum as an additive operation on surfaces, S 2 acts like the 
identity. 



FIGURE 14.15: S 2 #S is homeomorphic to S. 


DEFINITION 14.15. The connected sum of a n tori, denoted nT, is called a 
genus n surface. A sphere is called a genus 0 surface. (See Figure 14.16.) 



n= 3 

FIGURE 14.16: The genus n surfaces, with n = 0,1,2,3. 


The genus n surface is a compact surface for all integers n > 0. Further¬ 
more, by the next theorem it follows that the genus n surface is orientable. 

THEOREM 14.16. Let Si and S 2 be compact surfaces. Then Sj#S 2 is ori¬ 
entable if and only if both Si and S 2 are orientable. 


Proof. See Exercise 14.10. 
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Exercises for Section 14.1 

14.1. The extra-point line was introduced and shown to be non-Hausdorff in Exer¬ 
cise 7.18. The underlying set is X =1U {p e }, where p e is an extra point, not 
contained in E. The topology on X is defined via the basis B consisting of all 
intervals (a, b) c E and all sets of the form (c, 0) U {p e } U (0, d) for c < 0 
and d > 0. 

(a) Show that every point in X has a neighborhood homeomorphic to an open 
interval. 

(b) Show that X has a countable basis. 

(The extra-point satisfies two of the conditions needed to be a 1-manifold, but 
it is not Hausdorff and therefore is not a 1-manifold.) 

14.2. Show that if Rd is E with the discrete topology, then Rj xK satisfies all of the 
properties of being a 1-manifold except that it does not have a countable basis. 

14.3. Let M be an n-manifold and assume x e M. Show that for each neighborhood 
U of x there is an open set B such that x e B c U and B is homeomorphic to 
the open n-ball. 

14.4. Prove Theorem 14.7: Let Si and S 2 be homeomorphic surfaces. Then S] is 
orientable if and only if S 2 is orientable. 

14.5. Although the Mobius band is not a surface, we can triangulate it as shown in 
Figure 14.17. Choose a direction of rotation on one of the triangles, and prove 
that it cannot be extended to a coherent orientation for the whole triangulation. 



FIGURE 14.17: A triangulation of the Mobius band. 

14.6. Show that every manifold is regular and therefore metrizable by the Urysohn 
Metrization Theorem (Theorem 5.24). (Hint: Use Exercise 14.3.) 

14.7. Create an example of a topological space that satisfies all of the conditions 
for being a 2-manifold except that it is not Hausdorff. (Hint: Consider Exer¬ 
cise 14.1.) 

14.8. Let M be a noncompact n-manifold. 

(a) Show that M is locally compact. (Therefore, the one-point compactifi- 
cation of M yields a compact Hausdorff space Y that contains M as a 
subspace and is such that Cl (M) = Y.) 

(b) Give an example of a 2-manifold M such that the one-point compactifica- 
tion of M is not a 2-manifold. 

14.9. Show that the connected sum of two compact surfaces is a compact surface. 

14.10. Prove Theorem 14.16: Let Si and S 2 be compact surfaces. Then S|#S 2 is 
orientable if and only if both Si and S 2 are orientable. 
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14.2 Euler Characteristic and the Classification of Compact Surfaces 

In this section, we continue our investigation of compact surfaces. We intro¬ 
duce the Euler characteristic, prove that it is an invariant for compact surfaces, 
and then use it—in conjunction with orientibility—to distinguish between com¬ 
pact surfaces. 

Given a triangulation of a compact surface 5, we can construct a polygon 
with pairs of edges to be identified that yields the surface. We begin with one 
of the triangles and perform a single-edge identification with another triangle. 
The result is topologically a disk. If there are remaining triangles, then because 
5 is a surface, at least one will glue to this disk along an edge on its boundary. 
Again, the result is topologically a disk. We continue this process until all 
triangles have been attached, resulting in a disk with pairs of edges in its 
boundary still to be identified. This is called a polygonal representation of 
S. The representations of the torus and the Klein bottle as squares with their 
edges glued together are polygonal representations. We saw other polygonal 
repimentations of surfaces in Section 3.4. 


EXAMPLE 14.1. In Figure 14.18 we see a polygonal representation of T#K, 
the connected sum of a torus and a Klein bottle. In the illustration of the 
resulting surface, we show how the glued edges appear. 


d 



b 

FIGURE 14.18: A polygonal representation of T#K. 


j 


Given a triangulation of a compact surface, we define its Euler character¬ 
istic as follows: 


DEFINITION 14.17. Let T be a triangulation of a compact surface S. We 
define the Euler characteristic of T by x(T) = V — E + F where V is the 
number of vertices, E is the number of edges, and F is the number of triangles 
in the triangulation. 

We use F for the number of triangles because we also think of them as 
faces, as if the triangulated compact surface is made up of triangular faces. The 
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expression V — E + F appears in both the Euler characteristic formula and 
Euler’s formula for planar graphs (Theorem 13.18). We discuss the connection 
between the two formulas later in this section. 

It turns out that the Euler characteristic is the same for every triangulation of 
a given compact surface. We express this in the following important theorem, 
which will be very useful in distinguishing compact surfaces: 

THEOREM 14.18. Let T\ and Tj be triangulations of a compact surface S. 
Then x(Ti) = x(T 2 ). 

Proof. Let 5 be a compact surface, and let T\ and T 2 be triangulations of 
S. First, if T\ and T 2 are equivalent, then it is straightforward that their 
Euler characteristics are the same. 

On the other hand, if T\ and T 2 are not equivalent, then by The¬ 
orem 14.11 there are equivalent triangulations Tl and Ti that are sub¬ 
divisions of T\ and r 2 , respectively. We claim that the Euler charac¬ 
teristic is preserved under subdivision. Given the claim, we then have 
X(T\) = x(T/), x(Ji) — x(T 2 ), and, since t[ and T 2 are equivalent, 
X(T[) = x(Tf). It follows that x(Ti) = x (T 2 ) in this case as well. 

Thus we need to show that the Euler characteristic of a triangulation 
T is preserved under subdivision. To do so, we think of T as a topological 
graph G in 5, made up of the vertices and edges of T. We can construct a 
subdivision T' of T by applying a sequence of operations to G, generating 
a sequence of topological graphs G = Go,.... G n , where G n is the 
topological graph corresponding to the subdivision T'. 

The operations are as follows; we illustrate them in Figure 14.19: 

(i) Add a new vertex that lies on an edge in G, to obtain G,+i. 
As a result, the edge in G, is subdivided into two new edges 
in Gj+i. 

(ii) Add a new vertex and a new edge to G,- to obtain G,+i. The 
added vertex and edge lie in the complement of G,, and the 
new edge connects the new vertex to a vertex in G,. 

(iii) Add a new edge to G; to obtain G,+i. The added edge lies in 
the complement of G,-, and it connects two vertices in G;. 

To show that the Euler characteristic is preserved under subdivision, 
we show that a sequence of these operations does not change the Euler 
characteristic. First, when a topological graph G is associated to a tri¬ 
angulation T of 5, then the vertices and edges of G correspond to the 
vertices and edges of T, and the components of the complement of G in 
S correspond to the triangles in T. Therefore, if we let V represent the 
number of vertices in G, E the number of edges, and F the number of 
components of the complement of G in 5, then V — E + F is the Euler 
characteristic. After each of the operations is applied to a topological 
graph, we do not necessarily have a topological graph that is associated 
with a triangulation, but we can still compute V — E + F for it. We claim 
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FIGURE 14.19: The operations on topological graphs for subdividing triangulations. 

that this value does not change under each of the operations, and therefore 
it follows that the Euler characteristic is unchanged under subdivision. 
Let us look at the effect of the operations on V — E + F : 

Operation (i): This operation adds a vertex to G; and subdivides an 
edge into two new edges. Therefore we have a net gain of one vertex and 
one edge, resulting in no change to V — E + F. 

Operation (ii): This operation adds a vertex and an edge to G, and 
leaves the number of components of the complement unchanged. There¬ 
fore we have a net gain of one vertex and one edge, resulting in no change 
to V — E + F. 

Operation (iii): This operation adds an edge to G, and splits one com¬ 
ponent of the complement into two new components. Therefore we have 
a net gain of one edge and one component of the complement, resulting 
in no change to V — E + F. 

It follows that the Euler characteristic is preserved under subdivision, 
and therefore we obtain the desired result: If T\ and T 2 are triangulations 
of a compact surface S, then x (Tj) = X (I 2 ). ■ 

Although intuitively apparent, it is nontrivial that operation (ii) in the fore¬ 
going proof leaves unchanged the number of components of the complement 
of the topological graph while operation (iii) splits one component into two 
new ones. These results follow from Theorem 13.17, a result that we did 
not prove but that generalizes the Jordan Curve Theorem (Theorem 11.2) and 
Theorem 11.10. 

Operations (i)—(iii) from the proof of Theorem 14.18 enable us to carry 
out the five steps that were used in the proof of Theorem 14.11 to construct 
a common subdivision of two triangulations in general position. Let T\ and 
T 2 be two such triangulations. If we take T\, then we can use operations (i)- 
(iii) to add in all of the vertices and edges (or subdivided edges) from T 2 to 
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superimpose the two triangulations. This essentially covers construction steps 
(i)-(iii) in the proof of Theorem 14.11. Then we can use operations (i)-(iii) 
again to insert vertices and edges (as described in steps (iv) and (v) in the 
proof of Theorem 14.11) to make triangles out of regions that are not already 
triangles. 

Now that we have Theorem 14.18, we can define the Euler characteristic 
of a compact surface S to be the Euler characteristic of any triangulation of 
S. We denote the Euler characteristic of S by x (S)- 

It is clear that if T is a triangulation of S, and / : S -> S' is a homeomor- 
phism, then / naturally maps T to a triangulation T' of S'. Furthermore, the 
Euler characteristic of T ' equals that of T. Thus we have the following theo¬ 
rem, which establishes that the Euler characteristic is an invariant for compact 
surfaces: 

THEOREM 14.19. If S and S' are homeomorphic compact surfaces, then 
X(S) = X(S'). 


EXAMPLE 14.2. From the triangulations in Figure 14.8, we see that x (S 2 ) = 
2, x(T) = 0, x(K) — 0, and x(P) = L 

These Euler characteristic values, along with orientability, enable us to 
distinguish these four spaces. By Theorem 14.19, it follows that S 2 is not 
homeomorphic to T, K, or P, that T is not homeomorphic to P, and that K 
is not homeomorphic to P. The only distinction not made here by the Euler 
characteristic is between T and K, but we already made that distinction in the 
last section, using orientability. 


The topological invariance of the Euler characteristic (Theorem 14.18) and 
Euler’s formula for planar graphs (Theorem 13.18) are closely related results. 
Theorem 14.18 applies to compact surfaces, but only addresses triangulations, 
while Theorem 13.18 only applies to the plane and the sphere, but addresses 
general topological graphs. Both results imply that the Euler characteristic of 
the sphere is 2. 

As we have done with S 2 ,T, K, and P, we can use the Euler characteristic 
and orientability to distinguish all compact surfaces. The following theorem 
is an excellent example of the kind of result we seek. It gives a complete 
classification of compact surfaces. 

THEOREM 14.20. Every compact surface is homeomorphic to exactly one 
of S 2 , T#T#... #T, or P#P#... #P. 

The compact surfaces listed in Theorem 14.20 are illustrated in Figure 14.20. 
Just as nT represents the connected sum of n tori, we use nP to denote the 
connected sum of n projective planes. 

In order to prove Theorem 14.20, it is necessary to show that every compact 
surface is homeomorphic to at least one of S 2 , nT, or nP, and that no two of 
the surfaces among S 2 , nT, and nP are homeomorphic. Later in this section 
we use the Euler characteristic and orientability to show that no two of the 
listed surfaces are homeomorphic. 
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FIGURE 14.20: The complete collection of compact surfaces. 

We do not prove that every compact surface is homeomorphic to at least one 
of the listed surfaces. Proofs can be found in [Mun] and [Mas]. The idea is that 
given a compact surface, we can perform a series of cut-and-paste operations 
on it to show that it is in the form of one of the listed surfaces. We introduced 
cutting and pasting in Section 3.4. We present another cutting-and-pasting 
example in Example 14.3. 

Considering all of the compact surfaces that we can construct using con¬ 
nected sums, it is surprising that they all fall into one of the types listed in 
Theorem 14.20. For example, the Klein bottle A" is a compact surface that 
does not appear to be among the listed surfaces. However, we showed in Ex¬ 
ample 3.25 that K = P#P. Of course, P#P is one of the listed surfaces, and 
therefore so is the Klein bottle. 


EXAMPLE 14.3. Another compact surface that does not appear to be among 
the surfaces listed in Theorem 14.20 is T#P. This is a nonorientable surface. 
Therefore, by Theorem 14.20, it must be a connected sum of projective planes. 
In fact, T#P = P#P#P. In Figure 14.21 we show by cutting and pasting that 
T#P = K#P. Since K = P#P, it follows that T#P = K#P = P#P#P. 


The relationship T#P = K#P from Example 14.3 demonstrates that there 
is no cancellation law for the connected sum of surfaces. Such a cancellation 
law would imply that T is homeomorphic to K given that T#P is homeomor¬ 
phic to K#P. However, T is not homeomorphic to K since T is orientable but 
K is not. 

The relationships P#P = K and T#P = K#P yield the following char¬ 
acterization of connected sums of projective planes: 


THEOREM 14.21. For every integer n > 3, 


z=±T#P 

T#K 


if n is odd, 
if n is even. 


Proof. See Exercise 14.11. 
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FIGURE 14.21: Using cutting and pasting to show that T#P is homeomorphic to 
K#P. 

The following alternative form of the classification theorem for compact 
surfaces is a straightforward consequence of Theorems 14.20 and 14.21: 

THEOREM 14.22. Every compact surface is homeomorphic to exactly one 
ofS 2 ,nT,nT#K,ornT#P. 

We allow the possibility of no Ts in the latter two connected sums in 
Theorem 14.22 to include K and P in our list. In Figure 14.22, we illustrate 
the compact surfaces listed in Theorem 14.22. 



FIGURE 14.22: The complete collection of compact surfaces. 
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As demonstrated with the sphere, torus, Klein bottle, and projective plane 
in Example 14.2, we can use orientibility and the Euler characteristic to dis¬ 
tinguish between different compact surfaces. In order to distinguish between 
all of the compact surfaces, we need to understand the effect of the connected 
sum operation on the Euler characteristic. 

LEMMA 14.23. If Si and S 2 are compact surfaces, then 

X(5i#5 2 ) = x(5i) + X (S 2 )-2. 

Proof Since the Euler characteristic is independent of the triangulation, 
we choose triangulations of Si and S 2 so that the disks whose interiors 
we remove in the process of taking the connected sum are triangles in 
the triangulations. Then the connected-sum operation corresponds to 
removing the interior of a triangle from each surface and identifying the 
boundaries of the triangles with each other, gluing vertices to vertices 
and edges to edges. But each boundary consists of three vertices and 
three edges. Hence, in the process of combining the triangulations of 
S 1 and S 2 to obtain a triangulation of Sj#S 2 , we lose three vertices, 
three edges, and two faces. The resulting Euler characteristic is then 
X (Si) + X (S 2 ) - 3 + 3 - 2. Therefore, X (Si#S 2 ) = x (Si) + X (S 2 ) - 2. 


THEOREM 14.24. For the compact surfaces n T and n P, the Euler charac¬ 
teristic is given by x(nT) = 2 — 2n and x(nP) = 2 — n. 

Proof. See Exercise 14.12. ■ 

By our results on orientability from the previous section, including Theo¬ 
rem 14.16, it follows that S 2 and nT are orientable while nP is not. Further¬ 
more, since x(S 2 ) = 2 and yfnT) = 2 — 2 n, it follows that no two of the 
compact surfaces among S 2 and n T have the same Euler characteristic. Sim¬ 
ilarly, no two of the compact surfaces n P have the same Euler characteristic. 
Thus, given any two of the compact surfaces listed in Theorem 14.20, either 
they do not have the same Euler characteristic or one is orientable and the 
other is not. Either way, we can conclude that no two of the compact surfaces 
listed in Theorem 14.20 are homeomorphic. We can draw the same conclusion 
for the compact surfaces listed in Theorem 14.22. This yields the following 
powerful result: 

THEOREM 14.25. Two compact surfaces are homeomorphic if and only if 
they have the same Euler characteristic and are either both orientable or both 
nonorientable. 

Theorems 14.20 and 14.25 finish off the classification problem for compact 
surfaces. We have a complete list of possibilities, with no repetition, and a 
simple algorithm (compute the Euler characteristic and identify whether or 
not the surface is orientable) for determining which compact surface listed in 
Theorem 14.20 corresponds to any given surface. 
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EXAMPLE 14.4. To see how the algorithm can be used to identify a compact 
surface, consider the surface S given by the connected sum K#K#P#P#T#T. 
It follows that S is nonorientable since there is a nonorientable surface in the 
connected sum out of which S is composed. Using Lemma 14.23 and the 
Euler characteristics for K, T , and P, we find that x(S) = — 8 . Among 
the compact surfaces listed in Theorem 14.20, the nonorientable surface with 
Euler characteristic —8 is 10P, the connected sum of 10 projective planes, and 
therefore S is homeomorphic to 10P. 


We would also like to consider a disk or a Mobius band as a surface, 
however, the points on their edges do not have neighborhoods homeomorphic 
to the open 2-ball. But they do have neighborhoods homeomorphic to half 
the ball. Thus, let H n be the subspace of the open n-ball B n given by H n = 
{(xi , X 2 , ■. . , x„) 6 B n | x n > 0}. We define a broader class of manifolds as 
follows. 


DEFINITION 14.26. An n-manifold M with boundary is a Hausdorjf space 
with a countable basis such that every point either has a neighborhood home- 

o 

omorphic to the open n-ball B n or has a neighborhood homeomorphic to H , 
and the set of points of the latter type is nonempty. 

The set of points that have a neighborhood homeomorphic to B n is called 
the interior of the manifold, and the set of points that have no neighborhood 
homeomorphic to B n , but have a neighborhood homeomorphic to H n , is called 
the boundary of the manifold. The boundary of M is denoted 3 M. 

A connected 2-manifold with boundary is called a surface with boundary. 


EXAMPLE 14.5. In Figure 14.23 we show a variety of surfaces with bound¬ 
ary. On the left, Si is a genus 3 surface from which the interior of two disks 
have been removed. In the middle, S 2 is a disk with two overlapping strips 
attached to its boundary. On the right, S 3 is a subspace of R 3 whose boundary 
(considering S 3 as a surface with boundary) is a trefoil knot. 



Si S 2 S 3 


FIGURE 14.23: Examples of surfaces with boundary. 
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In “surface with boundary” the meaning of “boundary” is distinct from 
its previous meaning, although the resulting sets could coincide. If a disk is 
considered as a subset of R 2 , then its boundary as a surface with boundary is 
the same as its boundary as a subset of R 2 . (See Figure 14.24.) However, in 
the case of the Mobius band embedded in R 3 , its boundary as a surface with 
boundary is a circle, whereas its boundary as a subset of R 3 is the entire Mobius 
band. 


o 

The boundary of D as a 
manifold with boundary 



The boundary of D 
as a subset of R 2 



The boundary of M as a 
manifold with boundary 



The boundary of M 
as a subset of R 3 




FIGURE 14.24: The boundary in a surface with boundary may or may not coincide 
with the boundary as a subset of a topological space. 


We will accept the following facts about surfaces with boundary S. They 
are plausible, but to prove them requires preliminary results that would take us 
too far afield to pursue. 

(i) The boundary of 5 is a 1-manifold. 

(ii) If 5 is compact, then its boundary has finitely many components. 

Much of what has already been addressed for compact surfaces (the exis¬ 
tence of triangulations, the definition of orientability, the Euler characteristic, 
and so on) carries over to compact surfaces with boundary. 

Let 5 be a compact surface with boundary. Each component of its boundary 
is a compact 1 -manifold, and therefore by Theorem 14.3, each such component 
must be homeomorphic to a circle. For each such boundary circle, we can add a 
disk with its boundary glued to the circle. This process is called capping off the 
boundary of S. Then, S together with these glued-on disks is a compact surface 
S* with no boundary. Therefore, every compact surface S with boundary 
is obtained by removing the interiors of finitely many disjoint disks from a 
compact surface S* among those listed in Theorem 14.20. 

If a surface with boundary S is embeddable in R 3 , it need not be the case 
that the corresponding capped-off surface S* is as well. For example, the 
Mobius band is a surface with boundary that is embeddable in R 3 , but when 
we cap off a Mobius band we obtain a projective plane, a surface that is not 
embeddable in R 3 . 
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In order to identify a compact surface with boundary S, it is enough to 
calculate its Euler characteristic,determine whether or not it is orientable, 
and determine how many boundary components it has. Then we consider the 
capped off surface S*. It is straightforward to see that the Euler characteristics 
of S and S* are related by x(S*) — X (S) + «, where n is the number of 
boundary components of S or, alternatively, the number of disks glued onto 
5 to obtain S*. Using the Euler characteristic of S* and whether or not S* is 
orientable, we can identify S* by Theorem 14.20. Then we have recognized S 
to be the resultant surface S* with the interiors of n disjoint disks removed. 


-1 

EXAMPLE 14.6. In Figure 14.25 we show a surface with boundary, S, that 
we consider as a subspace of M 3 . The boundary of S is connected and forms 
the knot denoted 74 . We would like to identify S. 



FIGURE 14.25- Which surface with boundary is this? 

Note that every loop in S must pass through an even number of half twists. 
Therefore, it is not possible to embed a Mobius band in 5, implying that S 
is orientable. Furthermore, the Euler characteristic of 5 is —1. (See Exer¬ 
cise 14.16.) Thus, if S* is the surface obtained by capping off S, then S* is 
orientable and x(S*) = /(5j 4- 1 = 0. It follows that S* is a torus, and S is 
the surface with boundary obtained by removing the interior of a single disk 
from a torus. 


Exercises for Section 14.2 


14.11. Prove Theorem 14.21: For every integer n > 3, 


nP = 


a=±T#P 

!L Y i T#K 


if n is odd, 
if n is even. 


14.12. Prove Theorem 14.24: For the compact surfaces nT and nP, the Euler char¬ 
acteristic is given by /(nT) =2 — 2 n and x(«P) = 2 — n. 

14.13. Consider the surface T#K#K#P#P#P. Which of the surfaces listed in Theo¬ 
rem 14.20 is this? Which of the surfaces listed in Theorem 14.22 is this? 

14.14. Consider all of the possible ways of creating a topological space from a square 
by gluing the edges in pairs. For each possibility, identify the space that results. 
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14.15. (a) Show that a hexagon with opposite edges identified straight across yields 

a torus. 

(b) Determine what surface is obtained when opposite edges are identified 
straight across on an octagon. 

(c) Determine what surface is obtained when opposite edges are identified 
straight across on a decagon. 

(d) Determine what surface is obtained when opposite edges are identified 
straight across on a 2n-gon, for n >6. 

14.16. For the surface with boundary S in Example 14.6, show that /(S) = —1. 

14.17. Identify each of the surfaces with boundary shown in Figure 14.26. 



FIGURE 14.26: Identify each of these surfaces with boundary. 

14.18. (a) Identify (without proof) all of the connected 1-manifolds with boundary, 
(b) Identify (without proof) all of the compact 1-manifolds with boundary. 


14.3 Three-Manifolds 

In this section, we look at a variety of examples of 3-manifolds, keeping in 
mind that any one might be the spatial universe in which we reside. 


EXAMPLE 14.7. The 3-sphere S 3 . The 3-sphere is defined as the set of 
points at a unit distance from the origin in 4-space, 

S 3 = {(*, y, z, w) | x 2 + y 2 + z 2 + w 2 = 1}. 

It is a compact connected 3-manifold. (See Exercise 14.19.) 

In Example 7.20, we discussed how S 3 can be regarded as the one-point 
compactification of 3-dimensional Euclidean space, R 3 . By adding just the 
single point to R 3 and appropriately defining a topology on the result, we 
obtain a space homeomorphic to S 3 . 

Here we consider another way to picture S 3 . We look at it in analogy 
to the 2-sphere, which can be thought of as an upper hemisphere glued to 
a lower hemisphere along an equator. Since the two hemispheres are each 
homeomorphic to the disk, we can think of the 2-sphere as coming from gluing 
the boundaries of two disks together. (See Figure 14.27.) 
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FIGURE 14.27: Gluing two disks together to obtain the 2-sphere. 

We can similarly construct the 3-sphere by gluing two 3-balls together 
along their spherical boundary. But where are these two 3-balls in our initial 
description of the 3-sphere as the set of points at a unit distance from the origin 
in 4-space? Let 

R = {(x, y, z, w) | x 2 + y 2 + z 2 + w 2 = 1 and w = 0). 

In other words, R is the subset of S 3 obtained by setting w = 0. Notice that 
for those points, x 2 + y 2 + z 2 = 1, so R is a 2-sphere. The two sets 

S 3 = {(x, y, z, w) | x 2 + y 2 + z 2 + w 2 = 1 and w < 0} and 

5+ = {(x, y, z, w) | x 2 + y 2 + z 2 + w 2 — 1 and w > 0 } 

each have boundary R. Therefore S 3 is obtained by gluing S 3 and S 3 together 
along their spherical boundaries, and each of these subspaces of S 3 is homeo- 
morphic to the 3-ball. (See Exercise 14.20.) Thus, in this way, we can see that 
S 3 results from gluing two 3-balls along their boundary. 


We would like to have a simple means for generating examples of 
3-manifolds. Just as surfaces can be triangulated, 3-manifolds can be de¬ 
composed into tetrahedra, any two of which can meet in a face, an edge, or a 
vertex, if at all. And just as we can glue the finite number of triangles in a tri¬ 
angulation of a compact connected surface together along a subset of the edge 
gluings to realize the surface as a polygon with pairs of edges to be identified, 
we can glue the finite number of tetrahedra in a triangulation of a compact, 
connected 3-manifold together along a subset of the face gluings to realize the 
3-manifold as a polyhedron with pairs of faces on its exterior to be identified. 

When discussing polyhedra, by the interior of a face we mean the set 
obtained by removing the perimeter edges and vertices from the face. Similarly, 
by interior of an edge, we mean the set obtained from the edge by removing 
the vertices on either end of it. These interiors correspond to the interior of 
the sets as manifolds with boundary, rather than the interior as subsets of the 
polyhedron. 


EXAMPLE 14.8. The 3-torus. In Chapter 3 we defined the 3-torus as the 
product space S 1 x S 1 x 5*. The circle S ' 1 has a countable basis and is compact, 
connected, and Hausdorff. It follows that the 3-torus, being a product of circles, 
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has these properties as well. Each point in S 1 has a neighborhood that is 
homeomorphic to an open interval. Therefore each point in the 3-torus has a 
neighborhood that is homeomorphic to a product of three open intervals, and 
such a neighborhood is homeomorphic to the open 3-ball. Hence, the 3-torus 
is a compact, connected 3-manifold. 

We can also obtain a 3-torus by gluing opposite faces straight across on 
a cube, as in Example 3.26. For similar constructions that we do hereafter, 
it is helpful to see in this representation how each point in the 3-torus has an 
open 3-ball neighborhood. A point in the interior of the cube has an open 
3-ball neighborhood, and therefore so does the corresponding point in the 
3-torus. (See Figure 14.28.) A point in the interior of a face has a half-ball 
neighborhood. That point is identified with a point in the opposite face that 
also has a half-ball neighborhood. These two half-ball neighborhoods glue 
together to form an open 3-ball neighborhood of the corresponding point in 
the 3-torus. A point in the interior of an edge has a quarter-ball neighborhood 
that glues together with the quarter-ball neighborhoods of three corresponding 
points in three other edges to form an open 3-ball neighborhood of the resulting 
point in the 3-torus. Finally, the eight points at the vertices have eighth-ball 
neighborhoods that also glue together to form an open 3-ball neighborhood 
about the resultant point in the 3-torus. Thus, every point in the 3-torus has an 
open 3-ball neighborhood. 



FIGURE 14.28: Each point in the 3-torus has an open 3-ball neighborhood. 


Based on this example, it seems relatively straightforward to generate com¬ 
pact, connected 3-manifolds. Just take your favorite polyhedron, with an even 
number of n-gon faces for each n, and glue the faces together in pairs so that 
edges glue to edges and vertices glue to vertices. We refer to this process of 
constructing a quotient space as a polyhedron gluing. 

We have to be careful, though, because the result of a polyhedron gluing 
might not be a 3-manifold. Given a polyhedron gluing, the points in the re¬ 
sulting quotient space that correspond to interior points of the polyhedron have 
open 3-ball neighborhoods, as needed. Also, the points that result from gluing 
points in the interior of the polyhedron’s faces have open 3-ball neighborhoods, 
obtained from gluing two half-ball neighborhoods, as in the case of the 3-torus. 
We are not guaranteed however, that the points that arise from gluing the edges 
of the polyhedron, or from gluing the vertices of the polyhedron, have open 
3-ball neighborhoods. In order to obtain a 3-manifold from a polyhedron glu¬ 
ing, there are additional conditions on the edge gluing and the vertex gluing 
that need to be satisfied. We discuss these conditions next. 
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To begin, we consider the edges. Given a polyhedron gluing, an equiva¬ 
lence relation is naturally defined on the edges such that two edges are equiv¬ 
alent if they are glued to each other. In each equivalence class of edges, all of 
the edges are glued together. Suppose we have an equivalence class of edges, 
[e \,... , e p j. Let e* denote the set in the quotient space that results from glu¬ 
ing together the interiors of these edges. We assume that the midpoints of the 
edges are glued together, and we refer to the resulting quotient-space point as 
the midpoint of e*. 

Each point in the interior of an edge in the polyhedron has a neighborhood 
that resembles a section of an orange, as illustrated in Figure 14.29. Under 
the polyhedron gluing, orange-section neighborhoods are glued together to 
form quotient-space neighborhoods of each point in e*. We are interested in 
whether or not these quotient-space neighborhoods are homeomorphic to the 
open 3-ball. 



FIGURE 14.29: Points in the interior of each edge in the polyhedron have orange- 
section neighborhoods. 

Assuming that the edges in our equivalence class are ordered properly, we 
can form a face-edge sequence 

F[U], e\, F[ i, 2 ], F[ 2 , 1 ], ei, ^[2,2], • ■ • . F[p, 1]> e p , F[ p< z\, 

such that edge e, occurs on faces F[,- ( j] and F[,-, 2 ] for each i = 1 ,... , p, face 

F[i,2] is glued to face F[,+i,i] for each/ = 1. p — 1, and face F[ p . 2 ] is glued 

to face F[i,i]. For each edge in our equivalence class, take an orange-section 
neighborhood that contains the entire interior of the edge. We can choose 
these neighborhoods so that they glue to each other to form a neighborhood 
of e* in the quotient space. The face-edge sequence gives an ordering of 
the glued orange-section neighborhoods around e*, and, since the last face in 
the sequence is glued to the first face, the orange-section neighborhoods glue 
together all of the way around e*. 

Now, if we put an arrow on ei to represent a direction, the other edges 
then inherit directions from e\ as we glue the faces together in sequence. 
After we include e p , all edges have a direction. The final face gluing then 
pairs e p to e\ so that either the arrows agree or they do not. In the former 
case, the orange-section neighborhoods glue together to form an open 3-ball 
neighborhood of each point on e*. In the latter case, there are arbitrarily small 
neighborhoods of the midpoint of e* that have projective-plane boundaries. 
(See Exercise 14.21.) While we do not prove it here, it can be shown that in 
the latter case no neighborhood of the midpoint of e* is homeomorphic to the 
open 3-ball. 
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If the directions on e p and e\ agree, we say that the face-edge sequence 
is direction preserving; otherwise we say that it is direction reversing. It is 
straightforward to show that for an equivalence class of edges, if one face-edge 
sequence is direction preserving, then every face-edge sequence for the equiv¬ 
alence class is direction preserving (and therefore if one face-edge sequence 
for the equivalence class is direction reversing, then every face-edge sequence 
for the equivalence class is direction reversing). This leads to the following 
test: 

The Edge Test: Given a quotient space obtained by a polyhedron gluing, 
for each equivalence class of edges, form a face-edge sequence and determine 
whether or not it is direction preserving. If each equivalence class of edges has 
a direction preserving face-edge sequence, then the quotient space passes the 
Edge Test; otherwise, the quotient space is not a 3-manifold. 

For example, consider the polyhedron gluing for the 3-torus in Exam¬ 
ple 14.8. There are three equivalence classes of edges, and it is straightforward 
to see that each has a direction preserving face-edge sequence. Therefore this 
quotient space passes the Edge Test. Note that this is consistent with the ob¬ 
servation in Example 14.8 that each point resulting from gluing the interiors of 
edges has an open 3-ball neighborhood obtained by gluing together quarter-ball 
neighborhoods of points in the edges of the cube. 


EXAMPLE 14.9. Consider the gluing on the cube shown on the left in Fig¬ 
ure 14.30. On the right in the figure, the edges numbered 1-4 form an equiv¬ 
alence class for this gluing. If we let T, B, F, B', L, and R, represent the 
top, bottom, front, back, left, and right faces of the cube, respectively, then 
T, 1, F, B',2,L, R,3, B', F, 4, B is a face-edge sequence for the equivalence 
class. If we choose a direction for edge 1 as shown, then edges 2-4 inherit 
the depicted directions. The bottom face glues to the top face with a flip, and 
therefore the direction on edge 4 does not match the direction on edge 1 when 
those faces are glued. Thus, this quotient space fails the Edge Test and is not 
a 3-manifold. 




FIGURE 14.30: The equivalence class of edges {1,2,3,4} has a direction reversing 
face-edge sequence. 
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Next, let us consider the vertices in a polyhedron gluing. On the set of 
vertices, we define an equivalence relation such that two vertices are equivalent 
if they are glued together. For each equivalence class, all of the corresponding 
vertices are identified under the polyhedron gluing, resulting in a single point 
in the quotient space. 

Suppose we have an equivalence class of vertices, {uj,... , v n \. Let v* 
denote the point in the quotient space that results from gluing together these 
vertices. When we cut through the polyhedron near a vertex u,-, as illustrated 
in Figure 14.31, each v,- is the apex of a pyramid that has faces lying in faces 
of the polyhedron and has a polygon base whose interior is in the interior of 
the polyhedron. (By pyramid, we mean a polyhedron formed by taking all of 
the line segments that connect an apex point to a base that is a polygon, not 
necessarily a square.) 



FIGURE 14.31: Each vertex of the polyhedron is the apex of a pyramid obtained by 
cutting the polyhedron near the vertex. 

Now, take a collection of these pyramids, one for each u,-, such that when 
the polyhedron is glued together, the faces, edges, and vertices of the pyramids 
properly line up and are also glued together. Let B denote the subset of the 
quotient space resulting from gluing together the pyramids. The point v* lies 
in the interior of B in the quotient space. We are interested in whether or not 
the interior of B is homeomorphic to the open 3-ball. 

As a result of the way that the set B is constructed, the boundary of B is 
made up of the glued-together pyramid bases and consequently is a compact 
surface. We call this surface a surrounding surface for v*. If this surface is 
a sphere, then as we shrink the pyramids toward the vertices u,-, the resulting 
surrounding surfaces essentially yield a set of concentric spheres closing in 
on v*. It follows that B is homeomorphic to the 3-ball, and therefore v* has 
an open 3-ball neighborhood. If the surrounding surface is not a sphere, it 
can be shown (but we do not present a proof) that no neighborhood of v* is 
homeomorphic to the open 3-ball. This leads to the following test: 

The Vertex Test: Given a quotient space obtained by a polyhedron gluing, 
if each equivalence class of vertices has a surrounding surface that is a sphere, 
then the quotient space passes the Vertex Test; otherwise, the quotient space is 
not a 3-manifold. 

For example, consider the polyhedron gluing for the 3-torus in Exam¬ 
ple 14.8. There is one equivalence class of vertices, consisting of all eight 
vertices of the cube. Let v* be the point in the quotient space resulting from 
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the glued-together vertices, and let S be a surrounding surface. Note that S is 
naturally triangulated by the pyramid bases since each base is a triangle. 

We compute the Euler characteristic of S from this triangulation. Each 
vertex in the equivalence class contributes one pyramid base in forming S and 
therefore contributes one triangle to the triangulation. Thus, F = 8 for our 
Euler characteristic computation. There are edges in the triangulation 
of the surrounding surface since each pyramid base contributes three edges, 
but these edges are glued together in pairs in forming S. Hence, E = 12. 
Finally, prior to our gluing the polyhedron, each of the cube’s edges contains 
two vertices in the triangles that glue together to form S. Once the gluing is 
done, we have two triangulation vertices on each of the sets e* that result from 
gluing together the cube’s edges; that is, each equivalence class of cube edges 
contributes two vertices to the triangulation. We observed previously that there 
are three equivalence classes of edges; therefore V = 6. Consequently, the 
Euler characteristic of S is 2. 

From our classification results for compact surfaces in the previous section, 
it follows that S is a sphere. Therefore, this quotient space passes the Vertex 
Test. Of course, this is consistent with the observation made in Example 14.8 
that v* has an open 3-ball neighborhood resulting from gluing together eighth- 
ball neighborhoods of each of the cube’s vertices. 


EXAMPLE 14.10. Consider the gluing on the octahedron shown on the left in 
Figure 14.32. The top vertex of the octahedron is glued to no other vertex and 
therefore is the only point in its equivalence class. Let v* be the corresponding 
point in the quotient space. If we cut through the octahedron near the top 
vertex, we obtain a pyramid as illustrated in the middle of the figure. The front 
and back faces and the left and right faces of the pyramid are glued together, 
as in the octahedron gluing. The base of the pyramid is glued together on its 
edges to form a surrounding surface for v*. As illustrated in the figure, this 
gluing amounts to identifying the opposite edges of a square in the usual way 
to obtain a torus. Thus, this quotient space fails the Vertex Test and is not a 
3-manifold. 



FIGURE 14.32: This octahedron gluing results in a point with a torus for a surround¬ 
ing surface. 
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Combining our manifold-construction tests, we have the following theorem 
on polyhedron gluing: 

THEOREM 14.27. Let M be a quotient space resulting from a polyhedron 
gluing. Then M is a 3-manifold if and only if it satisfies both the Edge Test and 
the Vertex Test. 

Proof. As already discussed, if the quotient space does not satisfy either 
the Edge Test or the Vertex Test, then M is not a 3-manifold. 

Thus suppose that the quotient space M satisfies both the Edge Test 
and the Vertex Test. That M is Hausdorff and has a countable basis fol¬ 
lows in a manner similar to the case for the Klein bottle as a quotient 
space of a square, presented in Theorem 14.5. Small, open 3-ball neigh¬ 
borhoods, glued half-ball neighborhoods, glued orange-section neigh¬ 
borhoods, and glued neighborhoods of vertices can be used to define a 
countable basis and to separate points to show that M is Hausdorff. 

As previously discussed, the points in M that correspond to interior 
points of the polyhedron and the points in M that correspond to the interior 
of a face have open 3-ball neighborhoods. Since M satisfies the Edge 
Test, the points in M that result from gluing together the interiors of edges 
have open 3-ball neighborhoods obtained from gluing orange-section 
neighborhoods together. And, since M satisfies the Vertex Test, the points 
in M that result from gluing together vertices have 3-ball neighborhoods 
that come from taking the interior of glued-together pyramids. 

Therefore, if M satisfies the Edge Test and the Vertex Test, then M 
is a 3-manifold. ■ 

Let us consider some additional examples of 3-manifolds. 


EXAMPLE 14.11. The quarter-twist manifold. This manifold is obtained 
from a cube by gluing two pairs of opposite faces straight across and by gluing 
the last pair by a one-quarter twist, as shown in Figure 14.33. There are three 
equivalence classes of edges, each containing four edges, as occurs for the 
3-torus. Each of these., equivalence classes has a direction preserving face- 
edge sequence. (See Exercise 14.22.) 



FIGURE 14.33: Gluing the faces of a cube to obtain the quarter-twist manifold. 
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All eight vertices in the cube are identified with each other, and, as in the 
3-torus example, the Euler characteristic can be used to show that a surrounding 
surface of the corresponding point in the quotient space is a sphere. It follows 
that the quarter-twist manifold is a 3-manifold. 


EXAMPLE 14.12. The sixth-twist manifold. This quotient space is defined 
on a hexagonal prism, as shown in Figure 14.34. Each square face is glued 
directly to the square face opposite it, and the front and back hexagonal faces 
are glued together with a one-sixth twist. The quotient space satisfies the Edge 
Test and the Vertex Test and therefore is a 3-manifold. (See Exercise 14.23.) 



FIGURE 14.34: Gluing the faces of a hexagonal prism to obtain the sixth-twist 
manifold. 

_i 


EXAMPLE 14.13. The Poincare dodecahedral space. We identify each of 
the opposite pairs of faces on a dodecahedron by a one-tenth clockwise rotation, 
as in Figure 14.35. Note that a one-tenth clockwise twist identification from 
front to back is the same as a one-tenth clockwise twist identification from back 
to front. The resulting quotient space satisfies the Edge Test and the Vertex 



FIGURE 14.35: Gluing the faces of a dodecahedron to obtain the Poincare dodec¬ 
ahedral space. 
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Test and therefore is a 3-manifold. (See Exercise 14.24.) The 30 edges are 
divided up into ten equivalence classes, each containing three edges. This will 
be relevant in the next section. The vertices are divided into five equivalence 
classes, each containing four vertices. 


EXAMPLE 14.14. The Seifert-Weber dodecahedral space. In this case, 
we identify opposite faces on a dodecahedron by a three-tenths clockwise twist. 
(See Figure 14.36.) Here the 30 edges of the dodecahedron are divided into 
six equivalence classes of five edges each. All of the 20 vertices are identified 
with each other, and therefore there is just a single equivalence class of vertices. 
The quotient space is a 3-manifold. (See Exercise 14.25.) 



FIGURE 14.36: Gluing the faces of a dodecahedron to obtain the Seifert-Weber 
dodecahedral space. 


j 


As with 2-manifolds, we have a notion of orientability for 3-manifolds. 
Instead of nonorientability corresponding^ to the presence of a Mobius band, 
as in the case of a nonorientable 2-manifold, a 3-manifold is nonorientable if 
it contains a subspace known as a solid Klein bottle. 

DEFINITION 14.28. A solid Klein bottle is the quotient space obtained by 
identifying the opposite disks at the ends of a solid cylinder by reflection, as in 
Figure 14.37. 



FIGURE 14.37: A solid Klein bottle. 


Notice that each concentric cylinder inside the solid cylinder becomes 
a Klein bottle after the gluing. So a solid Klein bottle consists of a set of 
concentric Klein bottles shrinking down to a core circle. 
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What would happen if we were inside a solid Klein bottle and we walked 
out one end, through the gluing, and came back to where we started? 

Because of the way the gluing flips everything left to right, we would return 
with our hearts on the other side of our bodies. The faces of our watches would 
be reversed, and the hands on our watches would now move counterclockwise. 
Interestingly, we would not be able to tell the difference. To us, it would appear 
that our hearts were still on the left, and that our watches ran as they always 
did. But everyone that we left behind would appear reversed to us, as would 
we to them. 

Could the universe contain such a solid Klein bottle? From a mathematical 
perspective, that is possible. 

DEFINITION 14.29. A 3-manifold is nonorientable if it contains a solid 
Klein bottle. If it does not, then the 3-manifold is orientable. 


EXAMPLE 14.15. Consider the quotient space obtained from a cube by iden¬ 
tifying opposite faces as in Figure 14.38. We previously saw this space in 
Example 3.27. The front and back faces are identified straight across. The top 
and bottom faces are identified with a flip. The same is true for the left and 
right faces. The result of the gluing is a 3-manifold and is homeomorphic to 
.S' 1 x P, the product of a circle and a projective plane. (See Exercise 3.38.) 



FIGURE 14.38: Identifying opposite faces of a cube to obtain a space homeomorphic 
to S 1 x P. 

This is a nonorientable manifold since it contains a solid Klein bottle, 
obtained by gluing the ends of a solid cylinder that runs from the center of the 
top face of the cube to the center of the bottom face. 


One of the great, open questions in the field of 3-manifold theory was the 
Poincare Conjecture. First posed by Henri Poincare in 1904, it asked whether a 
3-manifold that behaved enough like the 3-sphere must in fact be the 3-sphere. 

In what sense must the manifold behave like the 3-sphere? Notice that if 
you take a loop in the 3-sphere (a continuous function / : S 1 —> S 3 ), then it 
can be shrunk down to a point. Since the 3-sphere is path connected and every 
loop in the 3-sphere can be shrunk to a point, the 3-sphere is said to be simply 
connected. (See Section 9.5.) In contrast, this is not the case with the 3-torus. 
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Under the polyhedron gluing that yields the 3-torus, a vertical line segment in 
the cube becomes a loop that cannot be shrunk to a point in the 3-torus. 

The Poincare Conjecture states that if a 3-manifold without boundary is 
compact and simply connected, then it must be homeomorphic to the 3-sphere. 
Poincare attempted to prove this and failed. Many other attempts were made 
over the last century, and there were a number of announcements of proofs 
that were subsequently determined to be incorrect. In 2002, a Russian math¬ 
ematician, Grigori Perelman, announced a proof that was judged correct after 
several years of scrutiny by mathematicians around the world. Not only did 
Perelman solve this celebrated, century-old problem, but his techniques have 
applications beyond the proof of the Poincare Conjecture and promise to revolu¬ 
tionize research in three-dimensional topology. In 2006 Perelman was selected 
to receive the Fields medal for his work (but he declined the award). 

Exercises for Section 14.3 

14.19. Prove that the 3-sphere, S 3 , is a 3-manifold. 

14.20. Consider the spaces S+ and Sl_ from Example 14.7. Define an explicit home- 
omorphism between each of these spaces and the 3-ball. 

14.21. Suppose we have a polyhedron gluing and [e\ ,... , e p ] is an equivalence class 
of edges that has a direction reversing face-edge sequence. Let p be the point in 
the quotient space corresponding to the glued-together midpoints of the edges 
e;. Show that in the quotient space there are arbitrarily small neighborhoods of 
p whose boundary is a projective plane. 

14.22. (a) Show that the gluing in constructing the quarter-twist manifold in Exam¬ 

ple 14.11 satisfies the Edge Test and the Vertex Test. 

(b) Replace the quarter-twist gluing of the front and back faces of the cube 
by a half-twist gluing. Show that the resulting gluing satisfies the Edge 
Test and the Vertex Test. (The resulting manifold is called the half-twist 
manifold.) 

14.23. (a) Consider the sixth-twist manifold construction in Example 14.12. On the 

hexagonal prism, label the edge classes and vertex classes according to 
the prescribed gluing. Show that the gluing satisfies the Edge Test and the 
Vertex Test. 

(b) Replace the one-sixth twist for gluing the front and back faces of the 
hexagonal prism with a one-third twist. Repeat part (a) for this prescribed 
gluing of the faces of the hexagonal prism. (The resulting manifold is 
called the third-twist manifold.) 

14.24. Consider the gluing for the Poincare dodecahedral space in Example 14.13. 

(a) Show that the gluing results in 10 equivalence classes of edges, each con¬ 
taining three edges, and five equivalence classes of vertices, each contain¬ 
ing four vertices. 

(b) Show that the prescribed gluing satisfies the Edge Test and the Vertex Test. 

14.25. Consider the gluing for the Seifert-Weber dodecahedral space in Example 14.14. 

(a) Show that the gluing results in six equivalence classes of edges, each 
containing five edges, and one equivalence class of vertices containing all 
the vertices. 

(b) Show that the prescribed gluing satisfies the Edge Test and the Vertex Test. 
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14.26. The quaternionic manifold is defined by identifying each pair of opposite faces 
in a cube with a one-quarter clockwise turn. 

(a) On a diagram of a cube, label the faces to depict the gluing for this space. 

(b) Show that the prescribed gluing satisfies the Edge Test and the Vertex Test. 

14.27. Consider the gluing of faces of the cube that yields a space homeomorphic to 
S 1 x P,as described in Example 14.15. On the cube, label the edge classes and 
vertex classes according to the prescribed gluing. Show that the gluing satisfies 
the Edge Test and the Vertex Test. 

14.28. (a) Represent the product of a circle and a Klein bottle, S 1 x K, as a quotient 

space obtained by gluing faces of a cube. 

(b) On the cube, label the edge classes and vertex classes according to the 
prescribed gluing. Show that the gluing satisfies the Edge Test and the 
Vertex Test. 

(c) Show that S 1 x K is nonorientable. 

14.29. Glue the faces of a tetrahedron as shown in Figure 14.39. Is the result a 
3-manifold? 



FIGURE 14.39: Does a 3-manifold result from this polyhedron gluing? 

14.30. Consider the octahedron gluings described in (a) and (b) and illustrated in 
Figure 14.40. In each case determine whether or not the gluing results in a 
3-manifold. 

(a) The faces are glued together in pairs across the top and across the bottom, 
in each case with a one-third clockwise twist. 

(b) The faces are glued together in opposite pairs through the center of the 
octahedron, in each case with a one-sixth counterclockwise twist. 




FIGURE 14.40: Does a 3-manifold result from either polyhedron gluing? 
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14.4 The Geometry of the Universe 

Let us return to considering the spatial universe in which we live. As we noted, 
it seems to have the property that around every point there is a neighborhood 
that is homeomorphic to an open ball in JR 3 . We assume that the spatial universe 
is Hausdorff and has a countable basis for its topology and therefore is a 
3-manifold. But which 3-manifold is it? 

We can use geometry to limit the possibilities. We will not be formal about 
what it means to have a geometry on a manifold, but it suffices to think of it 
as meaning that there is additional structure that allows the measurement of 
quantities such as distance, angles, and curvature. We are all familiar with 
Euclidean geometry on the line, the plane, and 3-space. As geometries, these 
spaces are denoted by E, E 2 , and £ 3 , respectively. 

The geometry of the universe seems to be isotropic and homogeneous. To 
be isotropic means that at each point the geometry appears to be the same in 
all directions around the point. There is no preferred identifiable direction. 
This is a property that Euclidean 3-space £ 3 has, for instance. However, if we 
form a geometry by taking S 2 x E, for example, that geometry has different 
behavior in different directions. 

To be homogeneous means that locally the geometry of the space is the 
same. Given any two points in the space, there is an isometry (a distance¬ 
preserving homeomorphism) from a neighborhood of one point to a neigh¬ 
borhood of the other. For the purpose of understanding the overall, global 
geometry of the inverse, we ignore the problems that may occur around black 
holes; they have an isolated effect that we assume does not disturb the global 
geometry. 

There are only three three-dimensional geometries that are both homo¬ 
geneous and isotropic: the Euclidean, spherical, and hyperbolic geometries. 
Just as R 3 is the model space for 3-manifolds, Euclidean 3-space E 3 , the 
3-sphere S 3 , and a space known as hyperbolic 3-space are the models for ge¬ 
ometric 3-manifolds. By a Euclidean 3-manifold, we mean a 3-manifold on 
which there is defined a geometry that locally behaves like the geometry in 
Euclidean 3-space. We similarly define a spherical 3-manifold and a hyper¬ 
bolic 3-manifold. We expect the universe to be one of these three types of 
3-manifolds, with a geometry that locally behaves just like the geometry in the 
corresponding model space. 

Each of the model three-dimensional geometries has a curvature associated 
to it. In the case of Euclidean geometry, the curvature is 0. In the case of 
spherical geometry, the curvature is positive, and in the case of hyperbolic 
geometry, the curvature is negative. 

Looking at analogs of these three geometries one dimension down, we 
have a plane with zero curvature, a sphere with positive curvature, and a saddle 
with negative curvature, as illustrated in Figure 14.41. 

On a plane, which has zero curvature, the angles of a triangle add up to 
exactly 180 degrees. On a sphere, which has positive curvature, the angles of a 
triangle add up to more than 180 degrees. And, on a saddle, which has negative 
curvature, the angles of a triangle add up to strictly less than 180 degrees. 
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curvature. 

Suppose we have a 3-manifold M obtained by gluing the faces of a poly¬ 
hedron in Euclidean space. In order for M to be a Euclidean 3-manifold, there 
are some conditions that must be satisfied. First, the gluing between the faces 
must preserve distance on the faces, so that the local geometry is Euclidean 
geometry at each point that results from gluing together two faces. Then, given 
that the faces are glued together preserving distance, when we glue orange- 
section neighborhoods together to obtain a neighborhood of a point in A/, the 
angles between the orange-section faces that glue together must add up to 360 
degrees. In other words, it is necessary that each point in M that results from 
gluing together edges has a full 360 degrees of Euclidean angle around it. Fi¬ 
nally, it must also be the case that the manifold inherits the correct geometry 
around the glued vertices. Small neighborhoods of polyhedron vertices that 
glue to each other must fit together so that the local geometry is Euclidean 
geometry in a neighborhood of the resultant point in the manifold. In all of the 
following examples, once the edge conditions are satisfied, the glued-together 
vertices inherit the required neighborhoods. 

We can also glue together the analogs of polyhedra in spherical space, and 
we can do the same in hyperbolic space. In both settings, in order for the 
result to be a 3-manifold with the corresponding geometry, we need to satisfy 
conditions like those just discussed for Euclidean 3-manifolds. 


EXAMPLE 14.16. Let M be a 3-manifold that results from gluing the faces 
of a cube in Euclidean space. At each edge in a cube, two faces meet with a 90 
degree angle between them. In order for M to be a Euclidean 3-manifold, it is 
necessary to have four edges in each equivalence class of edges, so that we then 
have the requisite 360 degrees in neighborhoods of points in M that result from 
gluing together edges in the cube. This occurs for the 3-torus, the quarter-twist 
manifold, and the half-twist manifold introduced in Exercise 14.22. Therefore 
each of these is a Euclidean 3-manifold. 


EXAMPLE 14.17. Consider the sixth-twist manifold from Example 14.12. 
It is formed by gluing the faces of a hexagonal prism in Euclidean 3-space. 
Let e be an edge at the intersection of two square faces in the prism. The angle 
between the faces is 120 degrees, and e glues together with two other edges 
that are each at the intersection of two square faces. Therefore, in the manifold 
each point that results from gluing these edges has a total angle around it of 
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360 degrees. Now, let e be an angle at the intersection of a hexagonal face 
and a square face. In this case, the angle between the faces is 90 degrees, and 
e glues together with three other edges that are each at the intersection of a 
hexagonal face and a square face. Here too, in the manifold each point that 
results from gluing these edges has a total angle around it of 360 degrees. It 
follows that the sixth-twist manifold is a Euclidean 3-manifold. 


EXAMPLE 14.18. The angles between the faces of a Euclidean dodecahe¬ 
dron measure cos -1 (—i-s/5) ~ 116.6 degrees. For the gluing on the dodeca¬ 
hedron that yields the Poincare dodecahedral space, we indicated that the edges 
are grouped into 10 equivalence classes of three edges each. This means that 
around the corresponding points in the manifold the total angle is shy of the 
needed 360 degrees. The resulting manifold is not a Euclidean 3-manifold. 

To resolve this situation, we realize the dodecahedron in spherical geom¬ 
etry. Placing a dodecahedron in spherical geometry increases the angles at 
its edges. In fact, for any fixed value of 9 between cos -1 (— j\/5) and 180 
degrees, there is a dodecahedron in the 3-sphere having 6 as the angle between 
each pair of intersecting faces. In particular, a regular dodecahedron exists in 
the 3-sphere with angles of 120 degrees between the faces at each of the edges. 
We can glue together such a dodecahedron to obtain the Poincare dodecahedral 
space, and in so doing we satisfy the requirement of having 360 degrees around 
each point in each glued-together edge. Therefore, the Poincare dodecahedral 
space is a spherical 3-manifold. 

For the gluing on the dodecahedron that yields the Seifert-Weber dodeca¬ 
hedral space, the edges are grouped into six equivalence classes of five edges 
each. Hence, we need to have an angle of 72 degrees between the dodeca¬ 
hedron faces at each edge. Here too, we cannot realize the resulting mani¬ 
fold as a Euclidean 3-manifold. However, given any angle 9 between 60 and 
cos -1 (—5 V5), in hyperbolic space we can construct a dodecahedron such that 
9 is the angle between each pair of intersecting faces. In particular, a regular do¬ 
decahedron with angles of 72 degrees between faces exists in hyperbolic space. 
Hence, the Seifert-Weber dodecahedral space is a hyperbolic 3-manifold. 


Data from the Wilkinson Microwave Anisotropy Probe, launched by the 
National Aeronautics and Space Administration (NASA) in 2001, has provided 
cosmologists with some evidence that the curvature of the universe is very close 
to 0. This either means that we live in a Euclidean universe, or means that we 
live in a spherical or hyperbolic universe with low curvature. If we live in a 
Euclidean universe with curvature 0, the geometry that we experience in the 
universe, even on the largest scale, is Euclidean geometry, the one we have 
grown up studying. So a giant triangle with vertices formed by distant galaxies 
would have the property that the sum of its angles is 180 degrees. If we assume 
that the universe is Euclidean, does this help us determine which manifold is 
the universe? Which 3-manifolds have a Euclidean geometry? Remarkably, 
there are only 18 possibilities, a result first proved in 1934 by mathematical 
crystallographer Werner Nowacki (1909-1988) in [Now]. 
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THEOREM 14.30. There are exactly eighteen Euclidean 3-manifolds, six 
compact and orientable, four compact and nonorientable, four noncompact 
and orientable, and four noncompact and nonorientable. 

This theorem is an amazing result, telling us that the possibilities for Eu¬ 
clidean 3-manifolds are extremely limited. In contrast, there are infinitely 
many spherical 3-manifolds and infinitely many hyperbolic 3-manifolds. 

If we assume further that the universe is compact, the number of pos¬ 
sibilities drops to 10. Four of the 10 are nonorientable. As we indicated 
in the previous section, there are some unusual consequences of traveling 
through a solid Klein bottle in a nonorientable 3-manifold. As fascinating 
as such a possibility is, there are physics aspects that make it highly unlikely. 
Thus, what remains for consideration are six compact and orientable Euclidean 
3-manifolds. 

Of these six manifolds, we have already seen the 3-torus, the quarter- 
twist manifold, the sixth-twist manifold, the half-twist manifold, and the third- 
twist manifold, in the examples and exercises in Section 14.3. The remaining 
possibility, the Handtche-Wendt manifold, is introduced in Exercise 14.32. 

Exercises for Section 14.4 

14.31. Show that the third-twist manifold of Exercise 14.23 can be realized as a Eu¬ 
clidean manifold. 

14.32. Define the Handtche-Wendt manifold by gluing together the faces on a pair of 
cubes as shown in Figure 14.42. 

(a) Show that the prescribed gluing satisfies the Edge Test and the Vertex Test 
and therefore the result is a 3-manifold. 

(b) Show that the Handtche-Wendt manifold is a Euclidean 3-manifold. 



FIGURE 14.42: Gluing faces on a pair of cubes to obtain the Handtche-Wendt 
manifold. 

14.33. The quatemionic manifold, introduced in Exercise 14.26, has one of Euclidean, 
spherical, or hyperbolic geometry. Which one is it? 

14.34. Show how each of the first five compact, orientable Euclidean 3-manifolds is 
a so-called torus bundle. That is to say, there exists a torus embedded in each 
manifold such that when we remove a neighborhood of that torus, the result is 
homeomorphic to T x [0,1], the product of a torus and an interval. 
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14.35. A parallelepiped is defined by any three linearly independent vectors in 3-space. 
For each of the three compact, orientable Euclidean 3-manifolds that can be 
obtained by identifying opposite faces of the cube, determine what restrictions 
on the shape of a parallelepiped are necessary for the manifold to be realized 
by identifying opposite faces of the parallelepiped. 

14.36. Find examples of three different noncompact and connected Euclidean 
3-manifolds. 

14.37. (a) Show that K x S 1 , the product of a Klein bottle and a circle, can be realized 

as a Euclidean 3-manifold. 

(b) Give an example of another compact, connected and nonorientable Eu¬ 
clidean 3-manifold. 


14.5 Determining which Manifold is the Universe 

Now that we have a list of some candidate manifolds for the spatial universe, 
how do we tell which is the right one? In this section, we discuss two methods 
that cosmologists are using to determine the shape of the universe. 

Cosmic Crystallography 

French cosmologists Marc Lachieze-Rey, Roland Lehoucq, and Jean-Pierre 
Luminet proposed this method in their 1996 paper [Leh]. Imagine we live in a 
3-torus, formed from gluing together the faces of a cube that has an edge length 
of 10 million light-years. Further, for now, imagine that we can see everything 
that is in the universe at this instant (we will adjust this physically impossible 
assumption shortly). When we gaze out in each of the directions perpendicular 
to the cube’s faces, as illustrated in Figure 14.43, our galaxy is in our line of 
sight, 10 million light-years away. Extending our line of sight further, we see 
our galaxy again 20 million light-years away, and then 30 million light-years 
away, and so on. Therefore, our galaxy forms a lattice pattern as we look out 
into the universe in all directions. Not only that, so does every galaxy we see 
within the glued-together cube. While galaxies may be scattered randomly 
within the cube, if we consider every galaxy image that we see as we gaze out, 
there are lots of pairs of images that are 10 million light-years apart, and there 
are lots of pairs of images that are 20 million light-years apart, and so forth. 



FIGURE 14.43: In a 3-torus universe, our line of sight brings us back to our galaxy. 
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In this 3-torus universe, it is not only at integer multiples of 10 million light- 
years that we see our galaxy. If we look in a direction parallel to a diagonal in 
one of the cube’s faces, then we see our galaxy %/2 x 10 million light-years 
away and at integer multiples of this distance. Also, if we look in a direction 
parallel to one of the cube’s diagonals, then we see our galaxy a/3 x 10 million 
light-years away and at integer multiples of this distance as well. There are 
other distances, too, at which we see our galaxy, depending on our viewing 
direction. 

By this reasoning, if we live in this 3-torus universe, and we measure 
the distance between every pair of galaxies that we see, then in the overall 
distribution of distances, we expect to have spikes as illustrated on the left 
in Figure 14.44. In contrast, if our universe is Euclidean 3-space, we might 
expect to see a somewhat random distribution of distances, centered around 
some average distance, as shown on the right in Figure 14.44. 



FIGURE 14.44: Hypothetical distributions of distances between pairs of galaxies in 
the 3-torus and in 3-space. 


In theory, this sounds great; however, we have to take into account the 
physical reality. It takes time for the light from galaxies to reach us, and 
galaxies are in motion, so two different images of a particular galaxy will not 
necessarily have a distance between them exactly equal to one of the previously- 
mentioned theoretical values (10 million light-years, a /2 x 10 million light- 
years, a/3 x 10 million light-years, and so on). But still, we can expect to see 
distances clustered around these values. Furthermore, galaxies are hard to spot 
when they are a great distance away, so they are not the best structures to be 
observing. 

An alternative that cosmologists are using is measuring the distances be¬ 
tween superclusters of galaxies. The hope is to use catalogs of superclusters 
to identify the distances between pairs of superclusters, and then to use pat¬ 
terns in the distance data to try to identify the manifold corresponding to the 
shape of the universe. With current supercluster catalogs, no patterns have yet 
been discovered. But in the near future, more complete catalogs will become 
available, which may provide patterns that help reveal the manifold in which 
we live. 
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The Cosmic Microwave Background Radiation 

In 1965, Amo Penzias and Robert Wilson of Bell Lab found an unexpected, 
constant background radio noise when studying radio emissions from the Milky 
Way. The scientists eventually realized that the noise they were receiving was 
essentially constant wavelength radiation coming from outside the galaxy in ev¬ 
ery direction in the sky. This radiation is now known as the cosmic microwave 
background (CMB). Their discovery turned out to have very important impli¬ 
cations, as this radiation proved to be the oldest light in the universe. The light 
of the CMB marks the first time after the big bang that the universe had cooled 
enough for radiation to escape from the hot cosmic soup. About 300,000 years 
after the big bang, the temperature of the universe was sufficiently cool to allow 
electrons and protons to combine to form the first atoms. This also meant that 
radiation, which had previously been constantly scattered by the free charged 
particles, was now able to travel at the speed of light through the still expand¬ 
ing universe. This escaped radiation is the cosmic microwave background that 
we now observe as a relatively constant temperature (about 2.73 Kelvin) radio 
source in every direction. The temperature of the CMB varies to one part in 10 
thousand. The small variations that exist in the CMB show us the slight density 
differences present in the early universe that eventually led to the clumping of 
matter into galaxies, stars, and planets through the force of gravity. 

The CMB radiation appears to emanate from a large sphere, called the 
surface of last scattering. Data from NASA’s Cosmic Background Explorer 
(COBE) satellite, launched in 1989, enabled the construction of the first tem¬ 
perature map of the CMB. NASA’s Wilkinson Microwave Anisotropy Probe 
(WMAP), launched in 2001, has provided CMB temperature data of signifi¬ 
cantly higher resolution than COBE. In Figure 14.45, we show a temperature 
map of the CMB derived from WMAP data. The darker shaded areas are 
cooler and the lighter shaded areas are warmer, but, as already indicated, the 
temperature difference on either side of the 2.73 Kelvin average is only about 
0.000IK. 



FIGURE 14.45- A temperature map of the CMB. (Courtesy of the NASA/WMAP 
Science Team) 
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Cosmologists Neil Cornish, David Spergel, and Glen Starkman, and math¬ 
ematician Jeffrey Weeks have proposed an approach, called “circles in the 
sky,” for determining the topology of the universe from the CMB tempera¬ 
ture profile. (See [Corl], [Cor2], and [Weel].) Imagine that the universe is 
a 3-torus formed by gluing together opposite faces of a cube. Furthermore, 
imagine that the surface of last scattering expands to the point that its diameter 
exceeds the edge length of the cube, as illustrated in Figure 14.46. When the 
sphere intersects opposite faces in the cube, it causes the sphere to intersect 
itself since the opposite faces are glued together. As shown in the figure, these 
self-intersections result in a collection of circles on the sphere, and when we 
examine the sphere from the inside, each intersection circle can be seen in two 
different locations on the sphere. 



FIGURE 14.46: Circles in the sky result from the sphere intersecting itself. 

The “circles in the sky” idea is to examine the CMB temperature map to 
find pairs of circles around which the temperature distributions are the same. 
Each such pair of circles could represent an intersection of the surface of last 
scattering with itself. The hope is .that the arrangement of intersection circles 
on the surface of last scattering will reveal patterns that can only occur within 
one of the manifolds that is a candidate for the topological shape of the universe. 
In this way, we may be able to identify the manifold in which we live. 

Exercises for Section 14.5 

14.38. If we are in a 2-dimensional universe that is a torus, constructed from a rect¬ 
angle with edge lengths 2 and 4, where do we see peaks in the distribution of 
intergalactic supercluster distances? If, instead, the edge of length 2 on the 
rectangle is glued to its opposite edge with a flip, so that the resultant space is 
a Klein bottle, where do we see the peaks? 

14.39. How do the peaks in the distribution of intergalactic supercluster distances com¬ 
pare between the case where we glue opposite faces of a 3 x 3 x 4 rectangular 
solid to obtain a 3-torus and the case where we glue the faces to obtain a half¬ 
twist manifold, where the twisted face is the one of dimensions 3x3. And how 
do those situations compare with the case where we glue the faces to obtain a 
quarter-twist manifold, where the twisted face is the one of dimensions 3x3? 

14.40. Suppose that the universe is a cube with opposite pairs of faces glued to yield ei¬ 
ther the 3-toms, the quarter-twist manifold, or the half-twist manifold. Explain 
how we could use circles in the sky to distinguish between these possibilities, 
indicating the differences between the circles’ appearance in each case. 






Additional Readings 

Here we list some additional readings related to the topics introduced in the 
text. The topics are listed in the order in which they first appear in the text. 
There is a wealth of literature on many of these topics; we primarily chose 
readings that provide a good starting point or a good overview. 

History of Topology: The compilation [Jam] consists of 40 papers that ad¬ 
dress different topics in the history of topology. The book [Man] is an expo¬ 
sition of the role that concepts and problems in analysis played in the mathe¬ 
matical developments leading to the axiomatic foundation of topology. 

Digital Topology: Along with [Kon], mentioned in the text, other papers that 
initially introduced topological spaces for modeling in digital image processing 
include [Khal], [Kha2], and [Kov]. 

Phenotype Spaces: Besides the paper [Fon 1 ], mentioned in the text, other pa¬ 
pers that address phenotype spaces and continuity in evolution include [Cupl], 
[Cup2], [Fon2], [Stal], and [Sta2]. 

Spatial Relations in Geographic Information Systems: There are many 
papers in the geographic information systems literature addressing topological 
spatial relations. The papers [Sch], [Sha], and [The] are a sample of works that 
are based on the model that is discussed in the text and that was presented orig¬ 
inally in [Ege]. An alternative topological spatial relation model is presented 
...In [Ren], 

Topology and Physics: The texts [Mon], [NasC2], and [Shv], at three dif¬ 
ferent levels of mathematical sophistication, introduce and discuss topics in 
topology and their role in physics. The paper [NasC 1] presents a history of the 
connections between Physics and Topology from the mid-nineteenth century 
to the present. 

The Forward Kinematics Map : The paper [B ak 1 ] is a relatively nontechni¬ 
cal introduction to topological properties of the forward kinematics map. Other 
papers that provide an introductory perspective on this topic include [Bak2], 
[Dem], and [Got], 

Error-Correcting Codes: The texts [Bay] and [Pie] present basic introduc¬ 
tions to the mathematics of error-correcting codes. 

Levenshtein Metric: The book [San] contains a series of papers about the 
Levenshtein metric; it includes applications to DNA sequence analysis, speech 
recognition, error correction, and bird-song analysis. 

Automated Guided Vehicles: The text [Lat] presents a broad introduction 
to the mathematics of robot motion planning. Along with [Abr2], mentioned 
in the text, [Abrl], [Ghrl], and [Ghr2] are a few recent papers that examine 
the structure of configuration spaces for automated guided vehicles. 
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Dynamical Systems and Chaos: The texts [DevR], [All], and [Str] provide 
introductions to various aspects of the theory and applications of dynamical 
systems and chaos. The text [Gil] gives a more advanced treatment of dynam¬ 
ical systems theory from a topological perspective. A nontechnical overview 
of the development of chaos theory from the 1960s into the 1980s can be found 
in [Gle], 

Economics Applications and Game Theory: The book [Von], by John Von 
Neumann and Oskar Morgenstern, is a foundational work on the mathemat¬ 
ics of game theory and economic models. Kakutani’s generalization of the 
Brouwer Fixed Point Theorem was originally published in [Kak], and Nash’s 
equilibrium-existence theorem was first announced in [NasJ], Recent texts 
covering fixed point theory and applications to economics and game theory 
include [Bor] and [Fra]. 

Knot Theory: The Reidemeister moves were presented by Kurt Reidemeis- 
ter in [Reil]; he later published a book on knot theory, [Rei2]. Vaughan Jones 
introduced his polynomial invariant in [Jon], 

The books [Adal], [Cro], and [Liv] are nice follow-ups to our material on 
knot theory; they expand on each of the topics that we present in Chapter 12, 
and they introduce a number of other topics in knot theory. 

Knot Theory Applications: The text [Sum] contains six separate papers 
addressing applications of topology and knot theory in biology, chemistry, and 
physics. The paper [Die] presents the original announcement of the synthesis 
of a knotted molecule. The book [Fla] addresses a number of topics related to 
knots and embeddings of graphs, focusing on their relationship to the structure 
of molecules. 

Topological Graph Theory: The book [Gro] is a nice introduction to topo¬ 
logical graph theory, beginning with basic ideas in graph theory and covering 
many topics related to the topology of graphs. 

Graph Applications in Chemistry: The text [DevJ] is a collection of papers 
addressing the use of topological descriptors in chemistry. The text [Tri] is a 
broad introduction to chemical graph theory, and the text [Bon] is a collection 
of nine separate articles on different aspects of this topic. Also, see the text 
[Fla] mentioned under Knot Theory Applications. 

Graph Applications in Electronic Circuit Design: The publication [Roh] 
is a special issue of the IEEE Transactions on Computer-Aided Design of 
Integrated Circuits and Systems; it is devoted to routing in microelectronics, 
and it consists of 11 papers, most of which employ topological or graph- 
theoretic ideas. The papers [Agg], |Bha], and [Fou] each address applications 
of crossing number or thickness to electronic circuit design or construction. 
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Manifolds and Cosmology: The article [Ada2] is a nontechnical introduc¬ 
tion to manifolds and their application to determining the shape of the universe. 
The paper [Thu2] presents a brief introduction to a variety of mathematical 
ideas in the study of manifolds. The book [Wee2] is an excellent introduction 
to the geometry and topology of surfaces and 3-manifolds, while [Thul] is a 
more advanced text that focuses on 3-manifolds. 
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tic-tac-toe, 114 

Tietze Extension Theorem, 243 
topoisomerase II, 400 
topological conjugacy, 261 
topological graph, 110,408 
is compact, 233 
topological property, 151 
topological space, 42 
topological transitivity, 274 
when sufficient for chaos, 289 
topologically equivalent, 141 
informal use, 94 
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coherently oriented, 449 
equivalence, 447 
subdivision, 447 
trivial knot, 382 

trivial link of two components, 391 
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